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CHAPTER  I 


Introduction 

1.1  Introduction  and  Summary 

A major  area  of  continuing  statistical  research  has  been  the  develop- 
ment of  procedures  to  test  the  conformity  of  data  to  a given  hypothesis. 
Such  procedures  are  known  as  goodness  of  fit  tests.  The  hypothesis  of  in- 
terest usually  stems  from  the  assumption  that  the  sample  population  is 
characterized  by  the  same  properties  which  characterize  a particular  dis- 
tribution (simple  hypothesis)  or  a particular  family  of  distributions 
(composite  hypothesis).  Goodness  of  fit  tests  are  useful  in  demonstrating 
by  means  of  formal  significance  statements  that  such  hypotheses  are  inade- 
quate to  explain  the  observations,  or,  in  the  absence  of  such  significance 
statements,  providing  the  impetus  for  further  data  analysis  based  on  other, 
parametric,  statistical  techniques. 

In  this  thesis,  we  are  concerned  with  tests  of  the  composite  null 
hypothesis  that  the  distribution  function  of  the  observed  sample  is  a 
member  of  a specified  scale- location  parameter  family,  where  the  scale  and 
location  parameters  are  unknown  and  unspecified.  We  first  study  a test  of 
this  hypothesis  proposed  by  Weiss  [41].  Then  we  turn  our  attention  to  the 
most  common  special  case — that  of  normality — by  proposing  and  analyzing 
several  tests  which  are  large-sample  analogs  of  the  most  efficacious  (based 
on  empirical  evidence)  small-sample  procedures.  All  of  our  results  are 
asymptotic,  as  the  sample  size  n tends  to  infinity,  and  all  of  the  test 
criteria  which  we  examine  are  based  on  functions  of  an  increasing  subset  of 
the  ordered  sample  values. 
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By  increasing  subset,  we  mean  that  as  the  sample  size  tends  to  infinity, 
the  number  of  order  statistics  which  we  consider  also  tends  to  infinity, 
by  assuming  certain  relationships  between  the  number  of  samp.’e  quantiles 
in  the  selected  subset,  the  separation  between  those  quantiles,  the  rate 
at  which  the  quantiles  move  into  the  tails  of  the  distribution,  and  the 
smoothness  and  size  of  the  tails  of  the  density  function  of  a standard 
representative  of  the  hypothesized  scale-location  parameter  family,  we  are 
able  to  assert  that  the  selected  subset  of  sample  quantiles  is  asymptoti- 
cally jointly  normally  distributed  with  known  mean  vector  and  covariance 
matrix.  The  motivation  behind  choosing  a gradually  increasing  subset  is 
that  we  achieve  a workable  distribution  theory  while  retaining  "most"  of 
the  information  contained  in  the  sample. 

The  Weiss  procedure  is  based  on  a quadratic  form  of  the  sample  quan- 
tiles which  has  an  asymptotic  chi-square  distribution.  Weiss  proved  that, 
under  the  null  hypothesis,  replacing  the  unknown  scale  and  location  para- 
meters by  estimates  obtained  from  the  sample  does  not  alter  the  asymptotic 
distribution  theory,  and  he  also  proved  that  his  procedure  is  consistent. 

In  Chapter  II,  we  examine  the  asymptotic  power  of  the  test  under  sequences 

«.  0 

of  contiguous  alternatives  of  the  form  H (x)  = G( — r — 1)  + 6 (x),  where 

» °2  n 

G(y)  is  the  standard  representative  of  the  scale-location  parameter  family 
appearing  in  the  null  hypothesis  and  the  "disturbance  function"  en(x) 
satisfies  certain  regularity  conditions.  By  letting  the  sequence  (en(x)} 
approach  zero  at  a certain  rate  and  with  respect  to  a certain  measure  of 
distance,  we  obtain  a non- trivial  power  for  the  test. 

In  section  2.1,  we  explicitly  define  what  we  choose  as  the  selected 
subset  of  order  statistics,  including  those  assumptions  required  for  the 
asymptotic  distribution  theory.  We  also  discuss  the  concept  of  asymptotic 
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indistinguishability  of  sequences  of  distribution  functions  which  provides 
the  basis  for  that  theory.  In  section  2,  we  describe  the  Weiss  test  of 
fit.  In  section  2.3,  we  introduce  the  sequence  {H^Cx)},  state  the  assump- 
tions which  the  functions  {en(x)}  must  satisfy  to  yield  a non-trivial 
power,  and  derive  several  useful  results  based  on  those  assumptions,  in- 
cluding the  joint  asymptotic  normality  of  the  selected  order  statistics 
under  the  sequence  of  alternatives.  Section  2.4  contains  the  derivation 
of  the  asymptotic  power  of  the  test.  Finally,  in  section  2.5  we  apply 
the  results  of  section  2.4  to  two  specific  cases — a sequence  of  Weibull 
distributions  which  approaches  the  exponential  distribution  and  a sequence 
of  contaminated  normal  distributions. 

In  Chapter  III,  we  turn  to  the  composite  hypothesis  of  normality. 

Using  the  same  approach  of  selecting  a gradually  increasing  subset  of  order 
, statistics,  we  develop  large-sample  analogs  of  the  Wilk-Shapiro  and 

Shapiro-Francia  tests,  two  of  the  most  effective  (.small  samples)  but  also 
least  understood  tests  for  normality.  We  show  that  the  analog  of  the 
Wilk-Shapiro  statistic,  which  we  call  W(n),  is  asymptotically  lormally 
distributed  under  the  null  hypothesis  as  the  sample  size  tends  to  infinity. 
We  also  show  that,  asymptotically,  the  analog  of  the  Shapiro-Francia 
statistic,  which  we  call  W*(n),  suitably  standardized,  has  the  same  dis- 
tribution under  Hq  as  a certain  weighted  sum  of  independent  chi-square 
random  variables.  We  further  prove  that  the  tests  based  on  ft(n)  and  vV*( n) 
are  consistent.  Finally,  we  study  the  behavior  of  those  tests  under  the 
sequence  of  alternatives  (Hn(x)}  of  Chapter  II,  except  that  here  the 
* sequences  of  disturbance  functions  must  satisfy  different  assumptions  re- 

garding the  rate  and  manner  of  approach  to  the  null  hypothesis  in  order 
to  guarantee  non-trivial  powers.  We  find  that  the  measures  of  distance, 


... 
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or  metrics,  for  the  tests  based  on  W(n)  and  W*(n)  are  quite  complicated. 
As  a consequence  of  the  power  studies,  we  propose  and  analyze  two  add.i- 

ft  * 

tional  statistics,  W(n)  and  W^'Cn),  which  yield  improved  tests  for 
normality.  We  also  find  that,  contrary  to  most  small-sample  empirical 
studies,  the  analogs  of  the  Shapiro- Franc ia  test  have  better  asymptotic 
power  than  the  analogs  of  the  Wilk-Shapiro  test,  and  we  show  by  example 
that  the  analogs  of  the  Wilk-Shapiro  test  can  be  biased,  at  least  if  the 
selected  subset  of  order  statistics  is  not  chosen  carefully. 

Summarizing  Chapter  III,  section  3.1  contains  a survey  of  the  litera- 
ture regarding  tests  for  normality.  In  section  3.2,  we  define  the  statis- 
% 

tic  W(n)  and  determine  its  asymptotic  distribution  under  the  null 
hypothesis.  We  also  state  in  section  3.2.4  the  null  hypothesis  result  for 

'V, 

W*(n),  deferring  the  proof  to  Appendix  B since  it  depends  on  several 
results  which  are  not  used  elsewhere  in  the  thesis.  In  section  3.3,  we 
prove  that  the  W-test  is  consistent.  In  section  3.4,  we  analyze  the 

'V. 

power  of  the  W-test  under  the  sequence  of  alternatives  (H^Cx)}.  The 

% 

analogous  results  for  W*(n),  and  are  given  in  section  3.5.  In  section 
3.6,  we  illustrate  the  foregoing  results  using  the  example  of  a sequence 
of  contaminated  normal  distributions  and  also  compare  the  tests  for 
normality  with  the  Weiss  test  discussed  in  Chapter  II.  Example  3.2  illus- 

*\j  'V 

trates  tue  bias  of  W(n)  and  ^(n).  Finally,  in  section  3.7,  we  discuss 
possible  areas  of  future  research. 

In  Appendix  A,  we  present  the  results  of  a Monte  Carlo  study  of  the 
Weiss  statistic  under  the  null  hypothesis.  In  Appendix  B,  we  present  a 
linear  transformation  of  the  selected  subset  of  order  statistics  to  stan- 
dard normal  random  variables  and  derive  the  asymptotic  distribution  of 
% 

W*(n)  under  the  null  hypothesis  as  mentioned  above.  In  Appendix  C we 


summarize  for  reference  purposes  various  formulas  concerning  the  inverse 
normal  distribution  function  which  are  used  frequently  in  Chapters  II  and 


The  remainder  of  this  introduction  is  devoted  to  a brief  discussion 
of  some  of  the  commonly  used  goodness  of  fit  tests  for  continuous  distri- 
butions, with  emphasis  on  tests  for  the  null  hypothesis  that  the  sample 
distribution  is  an  unspecified  member  of  a specified  scale- location 
parameter  family. 

1.2  EPF  Tests  of  fit 

A frequently  used  class  of  tests  for  simple  or  composite  hypotheses 
about  continuous  distributions  is  the  class  of  EDF  tests,  so-named  because 
they  are  based  on  differences  between  the  hypothesized  distribution  G(x) 
and  the  empirical  distribution  function  Gn(x).  Included  in  this  class 
are  tests  based  on  the  following  statistics: 

Kolmogorov-Smirnov : D = sup  1 Gn ( x ) - G(x)| 

00 

Cramer-von  Mises:  W2  = n/[G  (x)  - G(x)]2dG(x) 

-00 

CO  _ 

Anderson- Darling : A2  = n/[Gn(x)  - G(x)]2[G(x)(l  - G(x))]  ciG(x) 

-00 

Letting  F(x)  denote  the  unknown  distribution  function  of  the  sam- 
ple population,  research  on  the  above  tests  has  been  primarily  devoted  to 
the  following  cases,  classified  by  the  null  hypotheses  about  F(x):  (G(x) 

is  continuous  in  all  cases) 

Case  0:  F(x)  = G(x)  completely  specified 

Case  1:  F(x)  = G(x;0)  where  0 is  a scalar-valued  nuisance  parameter 

Case  1A:  G( . ) is  normal  with  unknown  mean 

Case  IB:  G(.)  is  normal  with  unknown  variance 

Case  1C:  G(.)  is  exponential  with  unknown  scale  parameter 
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Case  2:  F(x)  = GCCx-e^J/Q^)  where  0^  and  0^  are  nuisance  location 

and  scale  parameters 

Case  2A:  G( . ) is  normal  with  unknown  mean  and  variance 
Case  2B:  G(.)  is  exponential  with  unknown  scale  and  location  para- 
meters (Durbin  [12]  shows  how  this  case  can  be  transformed  to  case 
1C  above. ) 

Case  0 is  the  classical  case  and  has  been  extensively  studied.  The 

sampling  distributions  and  asymptotic  distributions  of  the  statistics  D, 

2 2 

W , and  A under  the  null  hypothesis  are  known  to  be  independent  of  the 
hypothesized  G(x).  A summary  of  the  rv - Its  and  references  appear  in 
Darling  [7].  Expressions  for  the  powers  of  .e  above  EDF  tests  do  not 
exist  in  closed  form.  Durbin  [11]  has  shown  how  the  power  may  be  approxi- 
mated in  the  case  of  D by  computing  boundary  crossing  probabilities 

for  the  tied-down  Wiener  process,  and  he  has  examined  the  asymptotic  power 

-1/2 

under  contiguous  alternatives  with  densities  of  the  form  hn(x)  = 1 + rn 
when  G(x)  is  the  uniform  distribution.  Empirical  results  for  the 
powers  of  the  various  tests  are  scattered  throughout  the  literature. 

When  nuisance  parameters  are  present,  the  EDF  tests  are  no  onger 
distribution- free.  The  test  based  on  D has  been  studied  under  the  null 
hypothesis  for  case  2 by  Serfling  and  Wood  [24]  and  cases  1 and  2 (and 
the  more  general  p-nuisance  parameter  case  as  well)  by  Durbin  [12]. 

Serfling  and  Wood  prove  the  weak  convergence  of  the  distribution  function 
of  the  statistic  to  that  of  a certain  Gaussian  process  with  a covariance 
function  which  depends  on  G(x)  and  the  parameters  being  estimated. 

Durbin  has  developed  techniques  for  inverting  Fourier  transforms  to  obtain 
the  distribution  of  D for  finite  n and  has  worked  out  case  1C  in  detail. 


-*-L, 
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Using  Monte  Carlo  methods,  Lilliefors  [16,17]  constructed  tables  of  criti- 
cal points  for  the  Kolmogorov-Smirnov  test  in  cases  1C  and  2A.  He  found 
that  the  statistic  D tended  to  be  stochastically  smaller  when  the  unknown 

parameters  were  estimated  from  the  sample. 

2 

The  statistic  W has  been  studied  under  the  null  hypothesis  m 

case  1 by  Darling  [6],  in  case  2A  by  Kac,  Kiefer,  and  Wolfowitz  [15],  and 
2 2 

noth  W and  A have  been  studied  by  Stephens  [34]  for  cases  1A  - 1C  and 
2A.  Briefly,  the  asymptotic  distribution  of  each  of  the  statistics  under 
the  null  hypothesis  is  the  same  as  that  of  the  random  variable 


J Y2(t)dt 
0 


where  Y(t)  is  an  appropriate  tied-down  Gaussian  process  with  zero  mean 
and  covariance  function  depending  on  the  statistic,  the  hypothesized  dis- 
tribution and  the  parameters  being  estimated.  This  distribution  is  the 
same  as  that  of 


l Z./A. 
? i i 


where  are  i.i.d.  chi-square  random  variables  with  1 degree  of  freedom 

and  A^  are  the  eigenvalues  of  a certain  integral  equation.  In  [30], 

2 2 

Stephens  gives  modifications  of  the  statistics  D,W  , and  A for  cases  0, 
1A-1C,  and  2A  so  that  the  asymptotic  critical  points  of  the  associated 
tests  may  be  used  for  finite  sample  sizes. 

Kac,  et.  al.  studied  the  asymptotic  power  of  tests  based  on  D and 

2 

W by  examining  the  ways  in  which  those  tests  measure  distance,  showing, 
in  particular,  that  those  tests  are  hyper- efficient  relative  to  the  optimum 


1 


\ 


i 

f 


I 


I 

l 

t 

i 


8 


chi-square  test.  Stephens  [30],  in  empirical  power  studies  on  tests 
2 2 

based  on  D,  W , A , and  two  other  EDF  statistics,  found  that  it  was 
better  not  to  have  the  true  parameter  values  available  in  conducting  the 
tests.  Apparently,  in  trying  to  fit  a density  of  a certain  shape  to  the 
data,  being  tied  down  to  fixed  parameter  values,  even  correct  ones,  is 
more  a hindrance  than  a help. 

1.3  Other  Tests 

The  oldest  well-established  test  of  fit  is  the  chi-square  test.  It 
is  particularly  well  suited  to  the  case  where  F(x)  is  discrete,  and  it 
is  known  how  to  adapt  the  statistic  to  the  case  when  there  are  nuisance 
parameters  which  must  be  estimated  from  the  sample.  When  F(x)  is 
continuous,  however,  chi-square  tests  are  neither  consistent  nor  do  they 
perform  as  well  in  general  as  the  EDF  tests  mentioned  above.  A unified 
summary  of  chi-square  theory  appears  in  the  paper  by  Moore  and  Spruill  [19]. 

Sample  spacings,  or  differences  between  successive  ordered  sample 
values,  have  provided  the  basis  for  many  tests  of  fit.  A comprehensive 
review  of  sample  spacings  theory,  associated  tests  of  fit,  and  the  related 
literature  is  given  by  Pyke  [20].  Under  the  null  hypothesis  that  F(x) 
is  uniform,  Cibisov  [4]  proved  that  the  Kolmogorov- Smirnov  test  is  hyper- 
efficient relative  to  any  test  symmetric  in  the  sample  spacings  under 

sequences  of  continuous,  contiguous  alternatives  of  the  form 
-1/2 

h^(x)  = 1 + r(x)n  . However,  in  his  review  of  Cibisov's  paper,  Weiss 
[37]  exhibited  a sequence  of  alternatives  such  that  the  reverse  is  true. 
Sethuraman  and  Rao  [25]  studied  the  Pitman  (asymptotic  relative)  effi- 
ciencies of  various  tests  based  on  sample  spacings  for  alternatives 
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h(x)=l+r(x)n  for  6 >1/4.  And  Blumenthal  [2]  studied  the  large- 
n n = 

sample  behavior  of  tests  based  on  spacings  when  nuisance  location  and 
scale  parameters  are  present. 

Finally,  there  are  many  tests  of  fit  which  have  been  developed  to 
test  a specific  null  hypothesis,  the  most  common  being  that  of  normality. 
A detailed  survey  of  tests  for  normality  is  presented  in  section  3.1. 

Most  of  these  tests  share  the  property  of  location  and  scale  invariance, 
thus  eliminating  the  need  for  estimators  of  the  nuisance  parameters  when 
the  null  hypothesis  is  composite. 


CHAPTER  II 

ON  THE  WEISS  TEST  OF  FIT 

2.0  Introduction 

Fox'  each  n,  X^n),  X2(n),...,  X^Cn)  are  independent,  identically 
distributed  random  variables  with  distribution  function  F(x).  F(x)  is 
known  to  be  absolutely  continuous  with  density  function  denoted  by  f(x) 
but  is  otherwise  unknown.  Formally,  the  decision  problem  of  interest  is 
the  following: 

Problem  P^:  Using  X^n) ,X2(n) , . . . ,Xn(n) , test  the  hypothesis 

Hq:  F(x)  = G((x-01)/02)  for  some  01502  ©2  > 0 

vs. 

H^:  F(x)  i G((x-01)/02)  for  any  01,02  wi-th  02  > 0 

Thus,  we  are  interested  in  determining  whether  or  not  F(x)  belongs 
to  a specified  scale-location  parameter  family.  The  function  G(x)  is  a 
standardized  representative  of  that  family;  the  nuisance  parameters  0.^ 
and  02  are  unknown  and  unspecified. 

The  test  statistic  which  we  will  be  concerned  with  has  been  proposed 
and  analyzed  under  the  null  hypothesis  by  Weiss  [41]  and  is  based  upon  a 
certain  subset  of  the  ordered  sample  values  Y^(n)  < Y^n)  <.. . . Yn(n). 

In  this  chapter,  we  determine  the  asymptotic  distribution  of  Weiss’  test 
under  sequences  of  alternatives  which  approach  the  null  hypothesis  as  n 
tends  to  infinity. 

In  section  2.1,  we  define  certain  parameters  and  state  certain 
assumptions  which  those  parameters  must  satisfy  in  order  to  guarantee 
desirable  asymptotic  proterties  of  the  test  statistic.  We  also  discuss 
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the  concept  of  asymptotic  indistinguishability  of  sequences  of  distribution 
functions  and  its  applications  to  our  testing  problem. 

In  section  2.2,  we  describe  the  Weiss  test  of  fit. 

In  section  2.3,  we  introduce  a class  of  alternative  distributions 
which  approach  the  null  hypothesis  as  n -*■  00 . We  state  assumptions  on 
the  manner  and  rate  of  approach  to  the  null  hypothesis  which  the  sequence 
of  alternatives  must  satisfy  in  order  to  yield  a non-trivial  power  in  the 
limit,  and  we  derive  several  useful  results  based  on  those  assumptions. 

In  section  2.4,  we  derive  the  asymptotic  power  of  the  test. 

In  section  2.5,  we  give  examples  to  illustrate  the  results  of  the 
preceding  sections. 

2 . 1 Preliminaries 

2.1.1  Definitions  and  Assumptions 

For  each  n,  let  pn,  0 < p^  < 1/2,  be  such  that  npn  is  an  integer. 

Further,  let  K and  L be  integers  such  that 
n n 

K L = n - 2np  . (2.1) 

n n *n 

Define  = glb{x:  G(x)  > 0}  and  B 2 = lub{x:  G(x)  < 1}.  (B^  may 

be  -00;  B2  may  be  +».)  Define  x as 

inf  (g(x) : G_1(pn)  4x4  G-1(l-pn)} 


where  g(x)  denotes  the  density  corresponding  to  G(x).  We  assume  the 
following: 


n-*°  p K 
rn  n 


Lim 


■n-x»  nx 


K 


Lim 


n 


1/2  3 
n-*x>  n x 

n 


Lim 


n 


. r3/2^2 

n-'»  L x 
n n 


= 0 


= 0 


= 0 


g(x)  < D < “ for  all  x in  (B^,B2) 


g'(x)  exists  and  |g'(x)|  < D..  < °°  for  all  x 


m 


(bi,b2) 


I x I g( x ) < D2  < « for  all  x in  (B^B^ 

There  exist  values  ‘’j.’S  w*tl1  ®1  = \ < S = ®2 

0 < T < g(x)  for  all  x in 

g(x)  is  nondecreasing  in  (B^,^) 
g(x)  is  nonincreasing  in  (?2,  B2) 

B2 

1 CygjSv\-]2g(y)dy  < 00 
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2.1.2  Some  Relations  Based  on  the  Assumptions 

The  assumptions  (2.1)-  (2.11)  imply  other  relations  which  will  also 
be  required.  These  relations  together  with  the  assumptions  implying 
them  are  summarized  in  Table  2.1.  Verifying  these  relations  is  straight- 
forward and  will  not  be  done  here. 

2.1.3  Notation 

Throughout  the  chapter  we  will  use  bars  over  the  names  of  random 
variables  which  are  used  to  illustrate  various  properties  or  represent 
certain  distributions  in  order  to  distinguish  them  from  those  random 
variables  directly  connected  with  the  problem  pn>  Primes  will  often 
be  used  to  indicate  differentiation  of  functions;  they  will  also  be  used 
to  indicate  transposes  of  vectors.  Since  no  vectors  will  be  differenti- 
ated, no  confusion  should  arise.  In  order  to  eliminate  many  unnecessary 
variable  names,  o(.),  °p(-)>  and  0(.)  notation  will  be  used  throughout. 
We  write  "x(n)  is  o(y(n))"  if  x(n)/y(n)  converges  to  zero  as  n 
Similarly,  "x(n)  is  o^(y(n))"  if  x(n)/y(n)  converges  to  zero  in  prob- 
ability as  n + Mx(n)  is  0(y(n))"  if  x(n)/y(n)  is  finite  as 
n «>.  As  is  traditional,  we  use  <Kx)  and  $(x)  to  represent  the 
c.d.f.  and  p.d.f.,  respectively,  of  the  standard  normal  (or  N(0,1))  dis- 
tribution. Finally,  in  order  to  simplify  notation,  the  dependence  on 
n of  X1(n), . . . ,Xn(n)  and  Y^(n) , . . . ,Yn(n)  will  be  suppressed,  and  we 

will  write  X,,...,X  and  Y, ,...,Y  instead. 

1’  5 n 1’  ’ n 

2.1.4  The  Joint  Asymptotic  Normality  of  an  Increasing  Number  of  Order 
Statistics 

The  joint  asymptotic  normality  of  a fixed  set  of  suitably  standard- 
ized sample  quantiles  from  a given  continous  distribution  is  well  known 
(see  e.g.  David  [8,ch.  9]).  The  result  below  is  a generalization  to 
the  case  where:  (1)  the  number  of  sample  quantiles  is  no  longer  fixed 
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TABLE  2.1 

IMPLICATIONS  OF  THE  ASSUMPTIONS 


Relation 

No. 

Implied  By 

L 

Lim  — = 0 
. np 
n-*» 

(2.12) 

(2. 1)6(2. 3) 

Lim  o /o  - 0 

«■*»  L3/2 

n 

(2.13) 

(2. 6)6(2. 7) 

K 

Lim  -r-rk — = 0 

n-*°  u x 
n n 

(2.14) 

(2.6) 

Lim  — ^ = 0 
n-x»  nx 

n 

(2.15) 

(2.4) 

Lim  = 0 

n-x»  L t 
n n 

(2.16) 

(2. 4)6(2. 6) 

Lim  ~ 0 

r 3 6 

n~x°  L x 
n n 

(2.17) 

(2. 4)6(2. 6) 

Lim  /2  4 = 0 

n-x»  L x 
n n 

(2.18) 

(2. 4)6(2. 6) 

l ^gM1  gU)i*  ‘ " 

(2.19) 

(2.11) 

B2 

J x^g('x"j;j  §(x)dx  < “ 

(2.20) 

(2.11) 

but  is  instead  allowed  to  depend  on  n and  slowly  increase  as  n does; 
and  (2)  the  underlying  distribution  is  replaced  by  a sequence  of  under- 
lying distributions,  one  for  each  value  of  n.  We  require  the  following 
definition: 

Definition  2.1.  Suppose  for  each  n,  F (x, x,  , ,)  and  G (x„ ,...,xw  s 

r ’ n 1*  k(n)  n V k(n) 

ariT>  joint  cumulative  distribution  functions  for  k(n)-dimensional 
random  variables.  The  sequences  (Fn)  and  {G^}  are  Asymptotically 
Indistinguishable  ( A . I . ) if 


Lim  |pp  {(X1,...,Xk(n))  eR(n)}  - PG  { (X^ . . . ,XR(n) ) eR(n)}|  = 0 


n-*» 


for  any  sequence  of  measurable  regions  (R(n)}  of  k(n)-dimensional 
Euclidean  space. 

Suppose  fn(x^». • • an<i  gn^xl’ ‘ * ‘ jXk(n) ^ are  probability  density 
functions  corresponding  to  F^  and  Gfi  respectively.  The  following  con- 
dition (Weiss  [38])  is  necessary  and  sufficient  for  Asymptotic  Indistinguish- 
ability: 


n 1’  ’ k(n) 

gn(Xl’",,Xk(n) 

assuming  (X^,. 


) 

y converges  stochastically  to  1 as  n 00 

. .,X,  , .)  has  distribution  F for  each 
k(n)  n 


n. 


(By  symmetry,  we  can  replace  F^  by  G^  and/or  invert  the  above  ratio 
without  affecting  the  result.)  Definition  2.1  defines  an  equivalence 
relation,  and  we  will  say  that  the  random  vectors  corresponding  to  (Fn) 
and  (G(n)}  are  asymptotically  equivalent  when  {F^}  and  {G^}  are  A. I. 


( 
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Now,  for  each  n,  let  X1}...,X  be  i.i.d.  random  variables  with 
absolutely  continuous  distribution  F^Cx).  Denote  by  Y^,...,Yn  the 
corresponding  order  statistics.  Let  satisfy  (2.1)  through 

(2.6)  with  replaced  by 

Tn  = inf  (fn(x):  F;1(pn)  < x < F^(l-pn)}, 


and  assume  (2.7)  and  (2.8)  are  satisfied  for  all  n with  g(x)  replaced 
by  f^(x)  and  and  replaced  by  = glb{x:  Ffi(x)  > 0)  and 

]j  = lub{x:  F^(x)  < 1}  respectively.  Note  that  (2.12)  - (2.18)  also 
hold.  Additionally,  assume  the  following: 


<n/pn)1/2f„<C(»n>)lV 

n-x» 


(2.21) 


and 


Lim  (n/pn)1/2fn(F"1(l-pn))[B2  - F^(l-pn)3  = ».  (2.22) 


n-*°° 


Finally,  define  Z^(n)  as 


np  +( j-l)L  _ 

/JT  f (F_( — —)  )C Y 


n n n 


np  +(j-l)L 
- F ~( — -)3 


np  +(j-l)L  n n 

rn  J n 


for  j = 1,2,..., K^tl . 

Then  Weiss  [393  has  shown  that  the  distribution  of  the  (Kn+l)-vector 

Z(n)  = (Zx(n), . . . ,ZK  (n))  is  asymptotically  equivalent  to  the  (Kn+1)- 

n 

variate  normal  distribution  with  mean  vector  0 and  covariance  matrix 
£(n)  = [oij(n)3,  where 
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L 

a (n)  = (pn  + (i-l)Ln/n)(l  - Pn  - (j-l)Ln/n)  (2.23) 

3 n 1 

for  1 _<  i < j < K +1. 

We  remark  that  the  result  of  Weiss  [39]  is  actually  proved  in  more 
generality  than  is  stated  above.  In  particular,  Weiss  allows  for  the 
possibility  that  the  bounds  and  B ^ on  the  support  of  Fn(x)  may  depend 
on  n,  as  might  the  upper  bound  on  f^tx).  However,  for  our  purposes 

these  extensions  are  unnecessary.  Also,  returning  to  the  assumptions  of 
section  2.1.1,  Weiss  [41]  has  shown  that  assumptions  (2.9)  and  (2.10) 
together  with  (2.1)  - (2.8)  are  sufficient  to  insure  that  (2.21)  and  (2.22) 
hold.  Hence,  the  asymptotic  normality  result  holds  with  fn(x)  and  Fn(x) 
replaced  by  g(x)  and  G(x),  respectively,  in  the  definition  of  Z(n). 

Reiss  [21]  has  further  generalized  the  above  results  (in  terms  of 
the  conditions  placed  on  the  "spacing"  of  the  indices  of  the  selected 
order  statistics)  and  obtained  Barry-Esseen  type  bounds  on  the  rate  at 
which  the  asymptotic  normality  takes  hold. 

Our  motivation  for  considering  an  increasing  subset  of  order  statistics 
as  defined  above  is  the  desire  to  have  a manageable  distribution  theory 
while,  at  the  same  time,  not  having  to  sacrifice  too  much  of  the  informa- 
tion contained  in  the  sample.  If  we  consider  a fixed  (finite)  number  of 
sample  quantiles,  we  have  an  asymptotic  distribution  theory  which  is  easy 
to  work  with,  but,  in  neglecting  the  remaining  order  statistics,  we  pay 
a high  price  in  terms  of  lost  information.  On  the  other  hand,  in  [40], 

Weiss  has  shown  for  a simple  hypothesis  that  an  increasing  subset  of 
o "* jc  statistics  defined  as  above  is  asymptotically  sufficient  for  the 
sample;  thus,  in  the  limit,  no  information  is  lost.  (Weiss  proves  the 
result  for  the  "equally  spaced"  case,  i.e.  = npn,  but  his  result  may 


Mu  -T mMt 


■ id,.  . /.A*. 


w 
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a 


be  easily  extended  to  the  case  = 0(nPn)  which  we  consider  here.) 


By  asymptotic  sufficiency,  we  mean  that  using  only  the  statistics 


{Yn  +(i_i)L  5 i=l,2, . . . jKn+l)  together  with  knowledge  of  the  distribu- 


tion G(x)  governing  the  population  from  which  the  sample  was  drawn. 


it  is  possible  to  generate  a random  n-vector  with  the  same  asymptotic 


distribution  as  the  complete  vector  of  order  statistics.  Furthermore, 


and  of  more  importance  in  studying  tests  of  fit,  the  same  results  holds 


if  Fn  rather  than  G is  the  true  distribution  function  for  the  sample. 


where  (F^  converges  to  G according  to  certain  regularity  conditions. 


2.1.5.  Some  Variances  Related  to  the  Multivariate  Normal  and  the  Inverse 


Covariance  Matrix  / (n). 


Assuming  Z(n)  = (Z, (n) , . . . ,Z„  ^.(n))  is  a (K  +l)-variate  normal 
± i\  +jl  n 


random  variable  with  mean  0 and  covariance  matrix  £(n),  (2.23)  implies 


var(Z1(n))  = var(ZK  +1(n))  = pn(l  +o(l)), 


(2.2*0 


var(Zj (n)  - Zj_1(n))  = (Ln/n)(l  +o(l)) 


(2.25) 


and , for  i < j , 


cov(Z.(n)-Z.  ,(n),Z.  (n)-Z._1(n))  = - ^(I^/n)2. 

1 “ 3 n” 


(.2.26) 


Futhermore,  for  1 <.  j <_  Kn+1,  var(Z^(n))  is  bounded  by  a finite  constant 


independent  of  j and  n.  We  will  frequently  have  occasion  to  refer  to 


the  inverse  covariance  matrix  £ ^(n).  It  is  easily  verified  that  £ ^(n) 


is  given  by  c J,1  (n),  where  c = n(L  -1)L  ^ and  J\(n)  denotes  the 
n^l  n n n ‘■'1 
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(K  +l)x(K  +1)  matrix 
n n 


In  appendix  B,  we  describe  a transformation  of  the  vector  Z(n)  to  a 
vector  of  i.i.d.  standard  normal  random  variables.  In  particular,  we 
present  a nonsingular  matrix  B(n)  such  that  B'(n)£  1(n)  = 1^  the 
(Kn+l)x(Kn+l)  identity  matrix. 

2.2  Construction  of  the  Test 

Before  describing  the  test  statistic  for  the  problem  P , we  first 

introduce  some  further  simplifying  notation.  Denote  the  quantity 
np  +( j-l)L 

G-1( — 2-^ -)  by  U..(n)  and  g(U..(n))  by  u.(n)  for  j = 1.2,... 

if  Hq  is  true,  then  F(x)  = G((x-0^)/02)  for  all  x, 

F-1(t)  = 0X  + 02G_1(t)  (2.28) 

and 

f (F_1(t) ) = ■a-g(.G"1(t)) 


(.2.29) 
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for  all  t e (0,1).  Denote  ^ (Ynpn+( j-l)Ln  " 01  " e2Uj(n))  bY 

Zj(n;01,e2),  and  let  Z^jO^Qg),. . . ,ZK  ^(nje^Gg)). 

n -1 

Henceforth  assume  without  loss  of  generality  that  G (1/2)  = 0 and 
G b(3/4)  - G b(l/4)  = 1.  This  can  be  accomplished  by  linearly  trans- 
forming the  parameters  and  has  the  advantage  that  the  estimators  for  0^ 
and  02  which  we  will  be  using  will  be  free  of  constant  terms  which  depend 
on  G(x). 

Finally,  let  A1  = 0^  + and  A 2 = ©2B2  + 01  denote  the  bounds 
on  the  support  of  F(x)  under  H^. 

2.2.1  Estimating  the  Parameters 

A A 

For  each  n,  we  will  estimate  0^  and  02  by  0^(n)  and  02(n), 
respectively,  where  O^n)  is  the  sample  median  of  the  sample  (X^,...,Xn), 

A 

and  02(n)  is  the  sample  interquartile  range  (the  difference  between  the 
third  sample  quartile  and  the  first  sample  quartile;.  Denote 

A ___  ^ 

/nto^n)  - 0][)  by  R^n)  and  /n(02)  - 02)  by  R2(n). 

Then  R^n)  and  R2(n)  are  asymptotically  jointly  normally  distributed 
with  zero  means  and  finite  variances  (see,  e.g.,  David  [8,p.  201]). 
Alternatively,  we  can  write 


O^n)  = 01  + R^nJ/^n 

(2.30) 

0,,(n)  = 02  + R2(n)/v/n  . 


We  should  note  that  there  are  more  efficient  estimators  than  0^(n)  and 

A 

02(n),  but  the  latter  have  the  advantage  of  simplicity  and  at  the  same 
time  are  "efficient  enough"  for  asymptotic  purposes.  That  is. 


■t 
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asymptotically  we  only  require  that  0^(n)  and  62(0)  approach  0^ 

/V 

and  ©2  at  a certain  rate;  in  particular,  that  *^n  (0^(n)  - 0^)  and 

A 

^n(©2  (n)  - ©2)  are  finite  with  probability  one  as  n ■+  “. 

2.2.2  The  Test  Statistic  and  Critical  Region 

Under  the  null  hypothesis  with  0^  and  as  the  true  parameter 

values,  the  assumptions  (2.1)  - (2.10)  imply  that  Z(n;01}02)  is 
asymptotically  normal  with  mean  0 and  covariance  matrix  £(n).  Thus, 
the  quadratic  form 


M(n;01,02)  2 cnZ'(n;01,02)51(n)Z(n;01,02) 


is  asymptotically  equivalent  to  a chi-square  random  variable  with  Kfi+1 
degrees  of  freedom.  Standardizing  M(n;0^,02),  we  obtain  the  result  that 

M(n;01,0_)  - (K  +1) 

T(n;0  ,0  ) = 

/2(K  +1) 
n 

is  asymptotically  normal  with  mean  0 and  variance  1. 

The  statistic  T(n-,0l502)  is,  of  course,  not  suitable  for  testing 
the  hypothesis  of  problem  since  it  .involves  the  unknown  parameters 

0^  and  ©2*  In  [41],  however,  Weiss  proves  that  0^  and  62  can  be 

A A 

replaced  by  their  estimates  0^(n)  and  without  changing  the 

A A 

asymptotic  result.  The  computable  statistic  T(n;01502)  is  asymptoti- 
cally N(0,1).  (Weiss'  proof  is  done  under  assumptions  (2.4)  and  (2.7) 
(2.11)  for  specific  functions  pn,Kn,Ln,  but  a review  of  the  proof  indi 
cates  that  (2.1)  - (2.3)  and  (2.5)  - (2.6)  are  sufficient  to  guarantee 
the  result.)  Thus,  for  fixed  a e (0,1),  if  we  define  c^  by 
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c.  , 

,1-ot 

/ <}>(x)dx  = 1 - a, 

— CO 

a level-a  test  of  Hq  is  given  by 

A A 

Reject  Hq  iff  T(n;0^,02)  > c$  ]_  a*  (2.31) 

The  upper  tail  of  the  distribution  is  chosen  as  the  critical  region  be- 

A A 

cause  T(n;0l502)  tends  to  be  larger  under  alternatives  than  under  HQ. 
In  [41],  Weiss  also  shows  that  this  test  is  consistent  under  mild  regu- 
larity conditions.  By  this  we  mean  that  for  any  fixed  F(x)  in  the 
alternative  hypothesis,  the  power  of  the  test  (2.31)  converges  to  1 as 
n tends  to  infinity. 

2.3  A General  Class  of  Contiguous  Alternatives 

The  remainder  of  this  chapter  is  concerned  with  determining  the 
power  of  the  test  under  sequences  of  alternatives  which  approach  the 
null  hypothesis  as  n gets  large.  Such  alternatives  are  termed  "conti- 
guous." By  properly  specifying  the  rate  of  approach,  we  are  able  to 
exhibit  a power  in  the  open  interval  (a,l)  and  thus  provide  a basis 
of  comparison  between  (2.31)  and  other  known  tests. 

The  alternative  distributions  Hn(x)  which  we  will  study  have  the 

form 


Hn(x)  = G((x-O*)/0*)  + en(x) 


(2.32) 


where  e*  and  0*  are  given  constants,  0*  > 0 and  where,  denoting 


9*B^  + 9*  by  A*  and  + by  A*,  the  functions  en(x),  which 

which  we  will  refer  to  as  "disturbance  functions,"  satisfy  the  following: 


e"(x)  exists  and  is  finite  for  all  x in  (A*. A*) 
n 12 


(2.33) 


-G((x-6*)/9*)  i en(x)  <1-  G((x-e*)/e|)  for  all  x in  (A*, A*)  (2.34) 


e^(x)  = 0 for  all  x i (A*, A*) 


(2.35) 


f e'  (:<)dx  = 0 
• ...  n 

Af 


^(x)  =£=  “ 'p'  g((x-0|)/9|)  for  a11  x in  (Ay  ,A|) 


Condition  (2.33)  is  a regularity  condition.  Relations  (2.34)  through 

(2.37)  insure  that  H (x)  is  in  fact  a distribution  function.  In  most 

n 

applications,  we  will  specify  Hn(x)  rather  than  en(x),  and  hence, 
(2.34)  - (2.37)  will  automatically  be  satisfied. 

None  of  the  above  conditions  specifies  anything  about  the  rate  at 
which  the  sequence  of  alternatives  {Hn(x)>  approaches  G( (x-9*)/9jp . 

To  insure  that  the  asymptotic  power  will  fall  between  the  level  a of  the 
test  and  1,  we  define 


w(n)  = max{  |u1(n)U,  (n)| , [u^  +1(n),JK  +1(n)|) 


v1(n)  = 


e (H~1(l/2 ) ) 
n n 


s (H"1(3/4) ) e (rtl/U)) 

v2(n)  = ty IT 

g(G  i(3/4))  g(G  X(l/4)) 


and  add  the  requirements: 


Limsup 

n-Ho 


jl/2  1/4 
n n 
1/2 


j(n)|vi(n)|  = d^  i=l,2 


A*  9 

,,,  ,2  [£'(4)]' 

L“  V,7  L -iroo- dlt  ■ d! 

n-*»  A“  n 


(2.39) 


(2.40) 


Lim 

n-*» 


b(n)  sup 


e"(x) 

Irool 

n 


0 


for  some  increasing  sequence  b(n)  with 


(2.41) 


Lim  b(n) 
n-*=° 


00 

9 


Lim  — 
n-x”  n 


5/4 


3/4 


n 


b(n) 


0 


I] 

t: 

l 

i 


where  d^d^dg  are  nonnegative,  finite,  and  not  all  zero.  The  motiva- 
tion for  conditions  (2.39)  and  (2.40)  will  become  clear  in  the  derivation 
of  the  power  of  the  test.  Heuristically , the  integral  appearing  in  (2.40) 

A A 

is  the  "characteristic  metric"  for  the  test  statistic  T(n;0^,02).  Con- 

A A 

dition  (2.39)  requires  that  the  estimators  0^(n)  and  02(n)  converge 

sufficiently  rapidly  in  some  cases.  If  = -°°  and  B2  = +°°,  it  is 

-1/2 

easily  shown  that  p^  w(n)  converges  to  zero  as  n -»■  <».  In  that  case, 

(2.40)  implies  (2.39)  with  d^  = d^  = 0 (see  (2.42)  below).  Condition 

(2.41)  is  a regularity  condition  on  the  tails  of  ejj(x)  which  enables  us 
to  approximate  complicated  sums  by  much  simpler  integrals.  In  most  cases, 

(2.41)  will  also  be  implied  by  (2.40). 


'MBMtejmrrrmu  w— i 


*4-^  1 " WJJ'\ 


Lemma  2.1.  If  (2.40)  holds,  then 


Lim 

n-Kir. 


jl/2,.1/4  I , , x 

L K sup  e (x)  < d0 

n n n 1 = 3 


1/2 


A«<x<A" 


Lim 


L^K1'* 

" " s„«C  V»  = 0 


n-H»  p 


1/2  n n '*V 


Lim 


l1/2k1/4 

" n * <h;1u-p„»  = o 


n-*»  p 


1/2  n n *V 


Proof.  For  any  x e (A*,A"), 


L1/V/l'|«  (x)|  = ^\xh  II  e'(y)dy| 
n n 1 n I n n u..,.  n J 


, r 1/2, ,1/4  ? len(x)L  1/2, 

4 K K„  j “T7S h_  (x)dx 


n n ' ,1/2,  , n 

A1  hn  X 
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(2.42) 


(2.43) 


(2.44) 


A2  re'(x)T^  A2 

<(LK1/2  J2)Vfldx)^{i  wx)dx)l/2. 

= n n h w n 

A"  n A^ 


By  (2.40),  the  right  hand  side  approaches  d„  as  n 00 , from  which  (2.42) 


follows.  Using  the  same  method,  but  over  the  interval  (A*,Hn  (pn)),  we 
obtain  the  sharper  result  (2.43)  for  the  tail  behavior  of  e (x).  We  have 


1/2  1/4  , 1/2Hn  ^Pn^[e’ (x)]2  ,/?  Hn  *pn^ 

l1/2k1/4  £ (H-l(  ))|  < {L  k1/2  J ” — --i1/2 

n n n n pn  - n n A* 


’ \ **  / -J  -t  tr\  **  ** 

irno^x}  < i hn(*>dx> 

i 


1/2 
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Hn  ^Pn^[e' (x)]2 

. fr  „l/2  nf  LenWJ  J ,1/2  1/2 

< IL  K I --r-  -r-v — dx}  p 

= n n h (x)  *n 

A“  n 


which,  after  dividing  both  sides  by  and  taking  limits  as  n 00 , 
yields  (2.43).  Finally,  (2.44)  follows  in  exactly  the  same  manner  when 
the  interval  (H  ^(1-p  ) ,A*)  is  used.  This  completes  the  proof  of  the 
lemma . QED 


Lemma  2.2.  If  (2.40)  holds,  then 


Lim  L1/2K1/I||;  e°<H"'(t))  t'(G^(t»G-1(t)at|  < 
n n ^ . \ \ / / ,_  \ \ 


0 h (H  (t))  g(G  x(t)) 
n n 


Proof.  Using  the  same  method  as  in  the  proof  of  lemma  2.1,  we  have 


W'*  } en(C(t))  f (G-\t))B-l(t)dt|  < 

n n i,  , ,„-l 


0 hn(HnX(t))  g(G  (t)) 


(t)dt  | 4 


aK^2/[!^^fdt)1/2(jcg,(s^(t)V1(t)]2dt)1/2. 

"n  Oh  (H_1(t))  0 g(G  X(t)) 

n n 


Making  the  transformations  x = H (t)  in  the  first  integral  and 
y = G ^(t)  in  the  second  integral,  the  right  hand  side  becomes 


,A2[e'(x)32 


fL  i/l/2 f L nv  ^ J l/2ff  , g^y)^  ,1/2 

tLnKn  {...^rur^x}  {L[y  g(y)J  dy}  • 


A”  n 


Taking  limits  of  both  sides  as  n ->  » and  applying  (2.11)  and  (2.40), 


I 


the  result  follows. 


2.3.1  An  Expansion 

In  this  subsection,  we  establish  the  following  useful  relation: 

e (rtt)) 

H ±(t)  = e*  + 6*G"x(t)  - 0*— — r + A.(n,t)  (2.45) 

n 1 2 2g ( G— 1 ( t ) ) 1 

for  te(0,l),  where  A^(n,t)/e  (H  ^“(t))  goes  to  zero  as  n ~ for  all 
te(0,l).  Alternatively,  we  may  write  (2.45)  as 

g ( H~ 1 ( t ) - 0*  - 0*G_^(t))  = -e  (H_1(t ) ) (1  + o (D)  (2.45') 
n 12  n n 

or,  using  (2.42), 

g(G--^r-)-(H~1(t)  - 0*  - 0*G_1(t))  = -e  (H~1(t))  + o(l“1/2K"1/4) . (2.45") 
0"  n jl2  nn  nn 

To  prove  (2.45),  let  x = H^(t).  Then,  from  the  definition  of  Hn(x) 

t = Hn(x)  = G((x-0*)/0*)  + en(x)  = G((x-0*)/0|)  + ^V^t)), 

so  that 


x = 0*G_1(t 


sn(Hn1(t)))  + 0*. 


Expanding  G (t  - 


e (H~1(t))) 


in  a Taylor  series  about 


G-1(t) 


yields 
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_■)  , £„(*) 

G (t  - e (x))  = G-i(t) : 

g(G_i(t)) 


£n(x)  g^G"1^)) 

2 g3(G-1(£) ) 


for  some  5 e (t,t-e  (Hn^(t).  Substituting  the  above  into  the  expression 
for  x completes  the  proof. 

2.3.2  Asymptotic  Normality  of  Order  Statistics  and  the  Asymptotic 
Distributions  of  the  Estimators  under  the  Alternatives 

In  order  to  derive  the  asymptotic  power  of  the  test  (2.31)  under  the 
sequence  of  alternatives  (H^Cx)},  we  will  require  the  asymptotic  normal- 
ity of  the  (K+l)- vector  Z*(n)  = (Z*(n), . . . ,Z*  ^,(n)),  where  ZsV(n)  is 
n 1 K +1  1 

n J 

given  by 


_ _i  nPj-Kj-DL  np  +(j-l)L 

/nhjHl- n)[Y  . . - 1(~ -)]. 

n n n np  +(j-l)L  n n 

*n  J n 


Let  t-  denote  inf  h (x):  H (p  ) <.  x < H ^(l-p  ).  Then  the  asymptotic 
x 

normality  will  follow  if  (a)  relations  (2.4)  - (2.6)  are  satisfied  with 

t replaced  by  t*  for  all  feasible  (p  ,K  ,L  ) combinations,  and  (b) 
nr  Jn  rn’nn 

conditions  (2.21)  and  (2.22)  are  satisfied  with  F ^(.)  and  f (.)  re- 

n n 

placed  by  (.)  and  hn(.),  respectively.  A sufficient  condition  for 

both  (a)  and  (b)  is  that  t*/t  remains  finite  as  n ■>  00 . In  particular, 

n n 

it  is  sufficient  to  show  that 

h (H”1(pn)) 

Lim  8*-  — — = 1 (2.46) 

n-xo  g(G  (pn)) 


since  the  argument  for  the  other  tail  is  the  same.  Using  (2.45'),  we  have 


g(G  *(pn)) 


g(G  *(pn)) 


and  the  right  hand  side  can  be  written  using  a Taylor  series  expansion  as 


e (H“1(pT,))g'(G_1(pT,))  e (rtp  )) 


n — ~r r — - — + —2 — 2- — 2 — + smaller  order  terms. 

Cg(G_i(pn))]2  g(G_1(pn)) 


Now,  l y (2.41), 


e' (H~1(p  ) ) e'(H  1(pn))  h (H*~1(p  ) ) h (rtp)) 

n n rn  n n n n n n , n n rn 


g(G“-L(pn))  hn(H‘X(pn))  g(G_i(pn)) 


g(G"  (pn)) 


as  n -v  °°.  Also,  by  2.8  and  (2.43), 


|en(Hn1('’n))g'<G~I(Pn»|  £ lEn(Hn1(Pn»lDl 

8<G'1(P„))  " Tn 


1/2  1/4  2^  ’ 
n n n 


and  (2.1)  and  (2.6)  together  imply  that  the  latter  expression  converges 
to  zero  as  n tends  to  infinity.  Hence,  (2.46)  is  proved.  To  demon- 
strate that  (2.46)  and  its  counterpart  for  the  other  tail  imply  (2.21) 
and  (2.22),  note  that  (2.21)  can  be  written  in  this  case  as 
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j \ 
\ 

i \ 


r 

t 


Lim  (— ^•)1/2-h~  n,  (P-^))[g(6~1(p  ) )(B1  -G~1(p  ) )+9^g  (H~1(p  ) ) (H-o(l) )]  = 
n-x»  Pn  g(G_1(pn))  n 1 n 2 n n 

which  is  easily  seen  to  be  true  using  (2. 43), (2. 2),  and  the  fact  that  (2.21) 
is  satisfied  by  G(x). 

In  summary,  we  have  shown  that  the  random  vector  Z*(n)  is  asymptoti- 
cally equivalent  to  a (K^+D-variate  normal  random  variable  with  mean 
vector  zero  and  covariance  matrix  £( n ). 

A special  case  of  this  asymptotic  normality  yields  the  fact  that 

R*  = /n(0_  - H_1(l/2 ) ) and  R*  = /n(0o  - H_1(3/4)  + H~1(l/4))  are 
1 1 n 2 2 n n 

asymptotically  jointly  normal  with  zero  means  and  finite  variances.  Using 
(2.45),  recalling  that  G-1(l/2)  = 0 and  G_1(3/4)  - G-1(l/4)  = 1,  and 
neglecting  terms  which  are  asymptotically  of  smaller  order,  we  can  write 


0]L(n)  = 0*  + R|//n  + Ogv^n) 


(2.47) 

A 

02(n)  = + 0*v2(n) 


where  v^(n)  and  v2(n)  are  defined  by  (2.38). 


I 

l 


A 


2.4  The  Asymptotic  Power  of  the  Test 

In  this  section,  we  derive  the  asymptotic  distribution  of  the  test 

A A 

statistic  T(n;0^,02)  under  the  sequence  of  alternatives  {Hr(x)}.  In 

A A 

particular,  we  show  that  T(n;6^,02)  is  asymptotically  normally  distri- 
buted with  variance  one  and  finite,  nonnegative  mean  Y given  explicitly 

© 

in  lemma  2.5.  Knowing  the  value  of  the  mean,  the  power  is  then  readily 
computable  by  referring  to  tables  of  the  standard  normal  c.d.f. 


'1 
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Throughout  this  section  we  will  refer  to  the  random  vector  Z*(n) 
with  components  Z*(n),...,Z*  ,,(n)  as  defined  in  subsection  2.3.2.  We 

JL  t\  T± 

n 

-1  np  +( j-l)L 

will  denote  Hn  ( — ) by  U\!(n)  and  hn(U‘^(n))  by  u‘v(n).  In 

addition,  define 


X . (n)  = u.(n)/0*u*(n) 
J J Z J 


and  let  A(n)  denote  the  (Kn+1)  * (K  +1)  diagonal  matrix  whose  elements 
X..  are  given  by 


X. . 
13 


(n) 


i=j =1 > 2 , . . . , K^+l 

i^j 


Finally,  denote  the  quantity 

u.  (n) 

- e2  - e2u.(n» 

A A A A A A 

by  n^(n;0^,0.^)  and  the  (Kn+l)-vector  (n2.(n;0^,02), . . . ,n£  +-j_(n;0^,02) ) 

A A 

by  h(n;0p02). 

Using  the  above  notation  and  appropriately  arranging  terms,  it  is 

A A 

easily  verified  that  the  test  statistic  T(n;01,02),  given  by 
C2(Kn+l)]"1/2(cnZ'(n-,01,02));"1(n)Z(n;01,02)  - (Kn+1)) 
can  be  written  as 
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c 0*2 
n 2 


✓2(k  +i)  : 2 

n o 2 


Z5':' (n)A(n)^1i(n)A(n)Z>v(n)  - c^C^+l) 

+ 2Z*' (n)A(n)J“1(n)n(n;01,62)  ) 


+ n'(n;e1,e2)^"1(n)n(n;elJe2) 


A A 


(2.48) 


Expanding  A..(n)  as  in  the  preceding  section,  we  have 


Aj(n)  = 1 - 


en(Uv(n))g' (Uj(n)) 


Uj(n) 


t 6-^n) 


where  <S^(n)  represents  smaller  order  terms.  In  particular,  using  the 

-1/2  -1/4  -2 

results  of  the  preceding  section,  Aj(n)  = 1 + 0(Ln  Kn  Tn  ^ as  n m‘ 

Hence,  A(n)  can  be  written  as  I„  , , where  A„  , . is  a diagonal 

K tJL  K t_L  i\  t1 

n n n 

-1/2  -1/4  -2 

matrix  all  of  whose  diagonal  entries  are  0(L  K t ). 

n n n 

Using  the  above  representation  of  A(n)  together  with  the  definition 
(2.27)  of  I~1(n)  and  the  expansion  (2.45'),  it  is  straightforward  to 
verify  the  following  result  expressing  the  test  statistic  in  a form  whose 
asymptotic  properties  can  be  investigated. 


Lemma  2.3.  Under  the  assymptions  of  section  2.3,  the  test  statistic 

A A 

T(n;0l582)  is  given  by 


/2U<n+l)  022 


{W*(n)Cl+0(L"V1/2T"4)]  - c'1(K  +1) 
n n n n n 

-Q(n)[l+0(L“1/V1/V2)]  t V(n)]} 


(2.49) 
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where  W*(n)  is  given  by 


Z*,(n)X"1(n)Z*(n), 


(2.50) 


Q(n)  is  the  sum  of  the  following  six 


L 

n 

1/2 
n p 


Cu  (n)(v  (n)+v  (n))  + e (U*(n))(l+o(l))]Z*(n)  h 
112  n 1 1 


11  ^UK  +1^n^v1^n)+v2^n^  + en^UK  +1(n))(3-+°(1))3z^  +1(n) 
. n n n 


(2.51) 


L f u (n)  UK  +l(n)  \ 

{ "if”(Ri  + Riui(n))zi(n)  + “If (Ri  + r£uk  +i(n))ZK  +i(nV  (2-52) 

n 2 2 n n i 


/n  l Cen(U*+1(n))  - en(U*(n))](l+o(l) )(Z*+1(n)  - Z*(n)) 
J -i 


(2.53) 


/nv  (n)  l (u.  (n)  - u.(n))(Z*  (n)  - ZvCn)) 

1 3+1  3 3+1  3 


(.2.54) 


11 

nv2(n)  1 (Uj+1(n)U.+1(n)  “ Uj(n)Uj(n))(Z*^+1(n)  - Z*(n)) 


C2.55) 


f*  .1  [(uj+i(n)  “ uj(n))Rl  + (uj+i(n)uj+i(n)  - ujtn)Uj(.n))Rf]  x 
2 j —1 


(.2.56) 


(Zj+l(n)  " Zj(n)) 


and  V(n)  is  given  by  the  sum  of  the  following  four  expressions: 
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[en(U*(n))(l+0(l))  + u1(n)(v1(n)  + v^rOU^n))]  + 


n *-en^UK  +1(n^(1+°(1))  + UK  +1(n)(v1(n)  + v2(n)UK  +1(n))]2 
n n nT 


(2.57) 


8*  npn 


Cen(U“(n))(l+o(l))  + u1(n)(v1(n)  + v2(n)U1(n))]  x 

[u1(n)(R*  + RgU-^n))]  + 

[en^uj<  +1(n) )(l+o(l))  + +1(n)(v.(n)  + v2(n)U  (n))]  x ^ (2.58) 

n n n+ 

[uK  +1(n)(RJ  + R2Uk 

n n 


— | {[u1(n)(R*  + R*U1  (n))]2  + [uK  (n)(R*  + R*UR  +1(n))]2} 

0*  npn  n n 


(2.59) 


- Rft  Rs'i 

Ce„(U*  (n))(l+°(l))  + u.,.  (n)(v,  (n)  + + (v_(n)  + — — )U.  .(n))]| 


K " 3+1  3+1  1 9»/J  2 e|/»  3+1 

»l{  . . 

j = 1 R;»  R;; 

- [e  (U$(n))(l-to(l))  + u.(n)(v.(n)  + — — + (v„(n>  + — — )U.(n))J 
"3  3 1 e*4  2 es^r  3 

V 2 2 

(2.60) 

From  section  2.3.2,  we  know  that  c^W-'Cn)  is  asymptotically 
equivalent  to  a chi-square  random  variable  with  (Kn+1)  degrees  of 
freedom.  The  next  two  lemmas  establishthe  limiting  forms  for  Q(n)  and 
V(n).  First  note,  however,  that 


c (K  +1) 
n n 


-1/2  = "V11 

l2(k  +l):L',2 

n n 


can  he  • eplaced  by  K ‘ for  al3  asymptotic  calculations. 
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l 


■> 


t 

j 


f, 

j 

i) 

< * 


A 


1/2 

Lemma  2.4.  The  quantity  Q(n)  converges  stochastically  to  zero  as 
n_  tends  to  infinity. 

Proof . We  shall  consider  each  of  the  terms  (2.51)  through  (2.56) 
separately.  Recall  that  for  asymptotic  purposes  we  can  consider  Z*(n),..., 

Z* 

Kn+l(n)  to  be  jointly  normally  distributed  as  described  above. 

(i)  Using  (2.7),  (2.24)  and  (2.42)  - (2.44),  the  quantity  in  expression 
(2.51)  is  asymptotically  normal  with  mean  zero  and  variance  of  order 


np  L ' K' 
>i  n n 


1 s2 

1/2„W  Pn 


L 

n 


„ ,.1/2 

“P 

n n 


1/2 

Hence,  Kn  (expression  (2.51))  is  asymptotically  normal  with  mean  zero 
and  variance  of  order 


LK1/2 
n n 

nP„  ! 


-1/2  -1 

which  is  0(Kn  pn  ) as  n tends  to  infinity.  Applying  assymption 
1/2 

(2.3),  Kn  (expression  (2.51))  converges  stochastically  to  zero  as  n ■+  00 . 

(ii)  Using  assumptions  (2.7)  and  (2.9),  the  term  in  braces  in  expression 

1/2 

(2.52)  is  finite  with  probability  one.  Thus,  since  K (L  /np  ) converges 

n n 

1/2 

to  zero  as  n *»■  « as  noted  ir  (i)  above,  (expression  (2.52))  con- 

verges stochastically  to  zero  as  n increases. 


en(Uj+l(11))  ~ en(U*(n))]z 


(iii)  Let  S^(n)  denote 


36 


and  let  S2(n)  denote 


2 l l Cen^+i(n)) 

i=l  j=i+l  J 


- e (Uv(n))][e  (Uv  (n) 
n 3 n l+l 


e 

n 


(Uv(n)) • 


Then,  using  (2.25)  and  (2.26),  expression  (2.53)  is  asymptotically  normal 
with  mean  zero  and  variance  given  by 

L2 

L S (n)(l+o(l))  - ~ S0(n)(l+o(i) ) . 
n 1 n 2 

Note  that 

A2  [e'U>:2 

si(n)  * (Ln /nM{.  TTx)  dx  + °(i)) 

A"  n 


and 


S2(n)  - -Sl(„)  ♦ «PnL;V/S) 


where  the  latter  expression  follows  from  (2.43),  (2.44)  and  the  fact  that 


K 


31(n)  + S2(n)  = { l Cen(U*  1(n))  - en(U*(n 
i=l  J 


))]}i 


= {e  (U*  . (n) ) - e (U*(n))} 

n K +1  n 1 

n 


(In  the  remainder  of  this  proof,  and  in  the  proof  of  the  following  lemma, 
we  will  frequently  pass  from  sums  to  integral?  in  the  above  manner. 
Assumption  (2.41)  insures  that  the  errors  of  such  approximations  are  always 
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of  smaller  order  than  the  integrals  themselves.  The  formal  procedure  is 
carried  out  in  detail  in  appendix  C,  and  upper  bounds  on  the  errors 
involved  are  obtained. ) 

Thus,  by  (2.40),  L S.(n)  = 0(L3/2n'3/2)  and  (L2/n)S,.(n)  = 

J n 1 n n 2 

0(L3/^2n  3/'2)  as  n ■>  “.  In  particular,  (L  /n)  3//2|L  S,(n)|  and 
n n 1 n 1 1 

-3 /2  2 

(L  /n)  I (L  /n)S0(n)|  both  remain  bounded  as  n increases.  Thus, 
n 1 n 1 

1/2 

since  (Ln/n)  converges  to  zero,  (expression  (2.53))  converges 

stochastically  to  zero  as  n 
(iv)  Let  S3(n)  denote 

K 

I [u  (n)  - u (n)]2 
3=1 


and  let  S4(n)  denote 


K -1  K 
n n 


2 l l " u.(n)][u.  ,(n)  - u.(n)]. 

i=l  j=i+l  1 3 J 

Then,  in  asymptotic  probabity  calculations  we  can  assume  that  expression 
(2.54)  is  normally  distributed  with  mean  zero  and  variance  given  by 


9 L L 

nv,  (n){— ^S0(n)(l+o(l))  + — ^S.,(n)(l+o(l))}. 


1 n 3 


2 4 


Note  that 


S3(n)  = (Ln/n){  / [^^-]2g(x)dx  + o(D) 


and 


«j—iL  ii m ryr 
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S4(n)  = -Sg(n)  + [uK  +1(n)  - u1(n)]  . 

n 


Thus,  (2.19)  implies  that  S.(n)  = 0(L  /n),  S (n)  is  o(l),  and,  using  (2.42) 

o n h 


o 22  3/2  —3/2 

of  lemma  2.1,  both  L v (n)S„(n)  and  (L  /n)v, (n)S.(n)  are  0(L  ' n ). 

n 1 3 n l 4 n 


1/2 


Thus,  arguing  as  in  (ii)  above,  K (expression  (2.54))  converges 


stochastically  to  zero  as  n ->  ». 


(v)  Let  Sc(n)  denote 
b 


K 


Jituj+1(n)Uj+1(n)  - u.(n)U.(n)]" 


and  lev  S.(n)  denote 
b 


K -1  K 
n n 


2 I I [u.  (n)U. (n)  - u.(n)U.(n)][u.  (n)U  (n)  - u.(n)U.(n)]. 

i=l  j = i+l  ■*■’**■*■  1 1 j'-*-  J J 


Note  that 


S5(n> 


(Ln/n ){/  [1  + xgg^y]2g(x)dx  + o(l)> 


and 


S.(n)  = -S(.(n)  + [u  ,,(n)U  (n)  - u (n)U  (n)] 

6 5 K +1  K +1  11 

n n 


From  (2.9)  and  (2.11),  S,.(n)  is  0(Ln/n)  and  Sg(n)  is  0(1)  as  n ■*•  ». 


1/2 


Hence,  arguing  as  in  (iv)  above,  we  have  the  result  that  K (expres- 


sion (2.5C))  converges  stochastically  to  zero  as  n increases. 


(vi)  Let  Sy(n)  denote 


■mb. 


I 
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K 


l [u.  (n)  - u.(n)][Z*  (n)  - Zv)n)] 

3+1  3 3+1  3 


3=1 


and  let  Sg(n)  denote 


K 


J1t-Uj+l(n)Uj+l(n)  “ uj(n)]CZj+ifn)  " Zj(n)]- 


Then,  expression  (2.56)  can  be  written  as 


1/2 

Imbedded  in  part  (iv)  above  is  the  fact  that  Kjj  S^(n)  converges 

stochastically  to  zero  as  n “.  Similarly,  in  part  (v)  it  is  shown  that 
1/2 

Kn  Sg(n)  converges  stochastically  to  zero  as  n increases.  Hence, 

1/2 

(expression  (2.56))  converges  stochastically  to  zero  as  n -y  ». 

1/2 

Combining  (i)  through  (vi)  above,  we  have  shown  that  Q(n)  con- 

verges stochastically  to  zero  as  n tends  to  infinity.  This  completes 
the  proof  of  lemma  2.4.  QED 


Define  v*  and  v|  by 


v*  = Lim  I*yV/4v  (n) 
1 n n 1 

n-Ko 

\>2  “ v2^n^ 
n-x» 


(We  assume  here  that  the  above  limits  exist,  as  they  will  in  most 
reasonable  cases.  If  they  do  not,  due  to  an  oscillating  disturbance 
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function,  we  may  replace  v*  and  v*  by  the  appropriate  Limsup  or  Liminf 
in  lemma  2.5  below  to  obtain  upper  and  lower  non- trivial  bounds  on  the 
asymptotic  power  of  the  test.  Relation  (2.42)  of  lemma  2.1  insures  that 
the  above  quantities  remain  finite  as  n -»■  °°.) 


1/2, 


Lemma  2.5.  As  n the  quantity  Kn  V(n)  converges  stochastically  to 


the  constant  y given  by  the  sum  of  the  following  five 
S 


expressions : 


L K1/2  2 

Lim  — {[u  (n)(v  (n)  + v2(n)U  (n))]  + [uK  +1(n)(v1(n) + v2(n)UK  +1(n))]  } 

ri  n 


p 

n-*»  rn 


(2.61) 


2 2 2 
d3  + vfj  [I^j]2g(y)dy  + 2v*v*{ J [.t!Mj2yg(y)dy  + J g'(y)dy)  (2.62) 


bx  g(y) 


. /0  , /111  % e'(H  1(x) ) v, 

2V?  Li»  L1/2K1/4  / -2-i «'(S  .U))  d* 

1 ...  n n J , /„-!/ 


n-x» 


,-1/ 


(2.63) 


pn  h (H  (x))  g(6  (x)) 

n n n 


T /O  l/l,1  Pl1  1(X))  ,,p-l/  \\  1 

2V*  Lim  L1/2K1/4  / ■■  ---  -n-i g (G  / G_1(x)dx 


(2,64) 


n-x» 


Pn  hn^Hn  (x))  g(G  (x)) 


B2  B2 

v*2[l  + 2 / yg'(y)dy  + / [yg-~[^-y]2g(y)dy]. 


(2.65) 


Proof.  We  consider  each  of  the  expressions  (2.57)  through  (2.60)  separately, 
(i)  From  (2.43)  of  lemma  2.1,  the  quantity 


L K1/2  2 

--■n- — e (U*(n)) 

P„  n 1 


■i-1 
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converges  to  zero  as  n tends  to  infinity.  Further,  we  can  write 


1/2 
L YT 

— e (U*(n))u1  (n)v  (n) 
p n 1 11 


■ n 


l1/2r1/4  L1/2K1/4 

" X7§—  £„(ui(n)>  '"lir  ui(n)vi(") 


■ n 


■ n 


and  the  right  hand  expression  converges  to  zero  by  (2.43)  and  (2.39). 

In  a similar  fashion,  we  can  show  that  the  contributions  to  y of  all 


terms  in  expression  (2.57)  which  involve  en(U*(n))  are  asymptotically 


negligible.  Using  (2.44)  of  lemma  2.1,  the  same  conclusion  follows  for 


all  terms  in  (2.57)  which  involve  e (U*  ,,(n)).  Eliminating  those  terms, 

n k tx 

1/2  n 

we  see  that  (expression  (2.57))  is  equivalent  to  (2.61). 


(ii)  By  assumption  (2.9),  the  quantities  u^(n)[R*  + RgU^n)]  and 

u.,  , ( n ) E R(*  + R$UV  (n)]  are  asymptotically  normally  distributed  with 

K.  +1  x Z K +1 


n n 

mean  0 and  finite  variances.  It  follows  from  this  fact,  the  fact  implied 


by  (2.3)  that 


L K1/2 
n n 


1/2 
n p 


l1/2k1/4 
o(-^^-) 


1/2 


n 


n 


and  the  relations  (2.43)  and  (2.44)  that  the  terms  involving  en(U*(n)) 


and  e (U*  ..(n))  in  expression  (2.58)  approach  zero  stochastically  as 

n K +1 

n 1/2 
n increases.  Neglecting  those  terms,  we  can  write  (expression 


(2.58))  as 


l1/2k1/4  L1/2K1/4 

— — [u1(n)(v1(n)  + v2(n)U1(n))] -n  — [u1(n)(R*  + R^U^n))]  + 


n 


n 
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l1/2k1/4  l1/2k1/4 

“■I7r"CuK  +l(n)(vl(n)  + V2(n)UK  tl(n))]  n 1/2  [UK  +l(n)(Rl  + R2UK  +l(n))L 
n n p n n 

n *n 


Applying  (2.39),  we  see  that  the  above  expression  converges  stochastically 


to  zero  as  n -»■  00 . 


1/2 

(iii)  Using  (2.3),  (2.7),  and  (2.9),  it  is  immediate  that  K 7 (expres- 
sion (2.59))  converges  stochastically  to  zero  as  n ". 

(iv)  Let  Sg(n)  denote  the  sum  in  expression  (2.60).  Then  Sg(n)  can 
be  written  as 


(L  /n){  J [-£  J (x)]2  dx  + 0(h"1/2n"1/2)} 


n 1 dx  n 
?n 


v/here  J (x)  is  defined  as 
n 


R*  R:’: 

tn^n1^^  + g(G_1(x))[v1(n)  + + (v2(n)  + — ~-^-)G-1(x)] 

0“n  7 0|n  7 


and  the  o(.)  term  follows  from  (2.41)  as  in  Appendix  C.  Expanding 


Jn(x),  we  obtain 


4-  J 

dx  n 


^(x))  .,fi-l, 


J (X)  = + g'-(G-  "(x-))[v  rn)  + -41—  + (v  (n)  + —^-^(x)] 


h (H"X(x))  g(G-1(x) ) 1 

n n 


QA  1/2  v 2 ‘a  1/2 

02n  02n 


+ v2(n)  + 1/2  . 

0^n 
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Hence,  Sg(n)  becomes 


^■“Pn 


n e’  (H  (x) ) 


n 


(L  /n>(  / T--" ♦ <v  („>  ♦ -a-,  I y(a-W' 


p h (H  (x)) 
n n n 


6*n  pn  g(G  (x)) 


+ 2(v  (n)  + (v-Cn)  + j)  / [—  ,^x) )-]2G~:L(x)dx 


0|n  ' 4 6*n  “ pn  g(G  (x)) 


. . . R2  ,2  fPnr-g,(G"1(x))„-l/  ,-,2  , , , . R2  n2 

t (v2Cn)  + J [*--■  -j G (x)]  dx+(v2(n)+ jpg) 

0|n  7 pn  g(G~  (x))  0*n 


Ri  . 7”  g*  (g-^(x)  > 


+ 2(v,(n)  +— i^)  / 

0*n  72  pn  hn(FrX(x))  g(G  1(x)) 


n*»-  l^Py.  ( H (x))  r /p~l/ xf  \ \ 1 

. 2fu  /_>  . 3 s f n -2-IL g ----J-----  G-1(x)  dx 

( 2 e^1/2  l hn(Hn  (x))  g(G  (x)) 

2 pn 


♦ «vl(.,  ♦ 4,XV n)  ♦ -^)Vn  ax 

0«n  0«n  Pn  g(G  (x)) 


^0  0 . n n-l 


t2(v(n)+  2 ,2;  M*, 

^ 6-in1^  pn  g(G_1!x)) 


+ 2(v.(n)  + 


R*  1”Pn  e'CH^Cx))  . ..  1/2 

2 \ f n n j..  , 1 /2> 


) j dx  + o(L  n )}. 

/ ft*  / _.  \ \ “ 


2 v ' * 1/27  ; -1 

»$“  ' Pti  h (H  ■ l(x)) 


■ n n n 


Combining  terms.,  changing  variables  of  integration,  and  noting  that  by 


(2.43)  and  (2.44), 


I 
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e' (H  (x))  -1/2  -1/4 

f J2-JL dx  = o(L  1/2K  1/4) 

p h n(H"1(x)) 

n n n 


we  obtain 


S9(n) 


L A2  Ce'(y)]2  R*  *2  ,()2 

r'/j  + (V"> + ^i72)^  [£iw]  s(y)dy 


R1  R|  ( } 2 

+ 2(Vn)  + — T72)(v2(n)  + ZHl72)(1  yg(y)dy  + 
e2n  e2n  B1 


/ g'(y)dy) 
B, 


, , . R1  ,XfPn  £A(Hn1(x))  g*(6-1(x)) 

+ 2(v  (n)  + r-rr)  J ri li 

1 0*niy  p h (H _1(x))  g(G  "(x)) 


■ n n n 


R*  1_Pn  e'Crtx))  g' (G_1(x) ) 

+ 2(v  (n)  + yjj)  J — t n G (x)dx 

2 6*n1/2  p h (H  X(x))  g(G  (x)) 

2 n n n 

R*  B2  B2 

+ (v2(n)  + - ^I72)2[1  + 2^  ys'(y)dy  + / ty^y]2g(y)dy] 

e2n  B1  B1 


. -1/2  -1/2., 

+ o(L  n )}. 
n 


Now  let  y denote  the  quantity 


2 2 
d3  + CI44]2g(y)dy  + 2v*v*(J  CI^y32yg(y)dy  + J g'(y)dy) 
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1/0  l/41_,Pn  6 (H^x))  g'^3"1^)) 

+ 2v*Lim  L1/2K1/4  / -^LJL _ dx 

n-*»  n n Pn  ^n^Hn  (*))  g(G  (*)) 


. 1/u1_Pn  e'Of^x))  g'  (G_1(x) ) 

+ 2v*Lim  Lj/2Kj/4  / -iL-Ji. - G^xjdx 


n-*» 


Pn  hn(H^1(x))  g(G_1(x)) 


B2  B2 

+ v|2[i  + 2 / yg'(y)  + / [ygg^y]2g(y)dy]. 


n 

j' 

S; 

l 


} 

X 

t 


I 


* t 


Then,  since  (expression  (2.60)  = nSg(n),  we  can  use  (2.1)  and  (2.4)  to 
1/2 

argue  that  n{products  involving  R*  and  R*  in  Sg(n)}  converges 
stochastically  to  zero,  and  hence,  for  any  e>0. 


Lim  P{ | K1^2( expression  (2.60))  - y | > e } = 0 

n-xo  *J 


Combining  (i)  through  (iv)  above  completes  the  proof  of  lemma  2.5.  QED 
The  asymptotic  power  of  the  test  (2.31)  now  follows  from  lemmas  2.3, 
2.4,  and  2.5,  and  is  given  in  the  following  theorem: 

Theorem  2.1.  The  asymptotic  power  of  the  test  (2.31)  is  given  by 


+ c.  ) 
4>,a 


(2.66) 


as  n 

Proof.  First  note  that 


46 


~2  = 0*2[0*2  + R* / /n  + v2(n)]  2 


R*//n  + v.(n) 

= [i  + - — — a"2 
02 


and  the  latter  expression  converges  stochastically  to  one  as  n ->•  ».  From 

A A 

lemma  2.3,  we  can  therefore  write  the  test  statistic  T(n;0l502)  as 

R*//n  + v (n)  0 c W*(n)  - (K  +1) 

[1  + ]~2C-n  ■ ] (1  + o(l))  + 

02  /2(K  +1) 

n 


R*^  + v2(n)  _2  0(  } R*//n  + v (n)  _2 

C1  + -f- ] (1+0(1))  + [i  + -1— —1  ] 101 

e2  v2  Q‘A  /2 


Using  lemmas  2.4  and  2.5,  we  have 


T(n;01,02) 


c W*(n)  - (K  +1) 


'2(K  +1) 
n 


— * * o„<i> 


as  n -»■  09 . Recalling  that  cnW*(n)  is  asymptotically  equivalent  to  a chi- 
square  random  variable  with  (Kn+1)  degrees  of  freedom  completes  the  proof 
of  theorem  2.1.  QED 


2.4.1  Remarks 


(1)  In  theory,  the  asymptotic  power  of  the  test  immediately  obtainable 
from  tables  of  <Kx).  The  quantity  Y , however,  appears  quite  formidable 
to  compute,  particularly  because  of  the  terms  (2.63)  and  (2.64).  Since 
G(x)  is  usually  easier  to  work  with  than  Hn(x),  the  following  relation 
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often  helps  simplify  the  computation: 


e'Crtx))  e*e*(0*  + 6*G_1(x)) 
n n 2 n 1 2 


h (H  1(x)) 
n n 


g(G_1(x)) 


(1+^(1))  for*  p < x < 1-p  . (2.67) 

n rn 


(2)  None  of  the  conditions  imposed  on  the  distrubance  function  en(x) 


specifically  required  that  H (x)  lie  outside  the  region  encompassed  by 


the  null  hypothesis.  In  particular,  suppose  we  take 


x - 6 


(n) 


»„<*> 


* S( 


,(n) 


where  0^  •>  0*  and  as  n ■>  ®.  For  our  definition  of 


alternatives  to  be  consistent,  we  should  obtain  a power  equal  to  a for 


the  a- level  test  (2.31)  when  {H  (x)}  is  the  underlying  sequence  of  dis- 


tributions. 

First  suppose  that  the  assumptions  of  section  2.3  are  satisfied,  i.e. 


x-e 


(n) 


X"  0 ** 

(e  (x)  = G( — ~V'  \~)  - G( — k~)} 
n In) 

02  02 


satisfies  (2.38)  through  (2.41).  Then  it  follows  that  the  constant  Y 


S 


equals  zero  and  the  power  is  given  by  9(c.  ) = a as  desired.  To  show 

9 ,0t 


this,  we  note  that  for  this  case  we  have  Hn^(t)  = i O^^G  \t),  ar.d 


form  (2.45"), 

e (H_1(t))  = - g(-~  -(---^)-£(.e(ri)-6>0  + C0^n)-0*)G_1(t)]  + o(if1/2K'1/4). 
nn  ...  1122  nn 


yA. 


-- 


! 1 


— miina aaiatafaft  uaaietSim  l-i  r— jeaaa ti 


MkMCI 
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Differentiating  the  above  expression  with  respect  to  t yields 


= . c<s:^.n  . + 


eln)-e* 


h(H  -"(t)) 
n n 


2 g(G  A(t)) 


g(6  ‘(t)) 


Also, 


and 


G (H~1(l/2) ) ejn)-0*  1/0  . 

f \ n n 1 1 f T ~1/ 2«>“l/4  v 

v.(n)  = -7 7TT = — + o(L  K ) 

1 g(0)  0»  n n 


0(n)  0* 

, * °2  °2  , /r-l/2(,-i/4s 

v2  n = 5|  * ° Ln  Kn  > 


so  that 


fl(n)  a * 
1/21/4,1  _°1 


v*  = Lim  h K 7 (— ~r:.r — ■)  and  v*  = Lim  C ) 


o(^)  a,*; 

* = T.im  Tl/2„l/4,_2  __~B2. 


1 n n 0„ 

n-*»  2 


2 n n ' Q% 

n-x°  2 


In  addition. 


e (U*(n))  = e (H~1(p  ))  = - u, (n)[v  (n)  + v„(n)U  (n)]  + o(L  1/2K  X/%  ), 
nl  nnn  11  2 l nnn 


and  similarly. 


en<C(J-Pn»  =-  \ U(n,tVn)  + V")UK  ♦!<”«  + 

n n 


Finally,  note  that  in  this  case  the  assumption  (2.36)  becomes 


e;n)-0?  b2 

0 = — 55— =■  / g'(y)dy  + 
2 Bl 


v(n) 

2 


* fZ' (y)dy  + 1], 
B, 
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so  that 


2 2 

v* / g'(y)dy  + v*[J  gy'(y)dy  + 1] 


Using  the  above  re.la  \ions  in  the  expressions  (2.61)  - (2.65)  yields 


immediately  the  desired  result  that  y^  = 0. 


For  the  sequence  (Hn(x)}  as  defined  here,  it  is  of  course  not 


necessary  that  the  assumptions  of  section  2.3  be  satisfied  in  order  that 


the  test  (2.31)  have  power  equal  to  a.  In  fact,  we  have 


u.  (n) 

r 


nj(n;01,02)  = v^-^-CU^(n)  - 0]L  - 0^1 


u.(n) 


v^-4ir_{C9in)‘i'e2n),J-(n);i“C0'n)  ]_C02n)  + -2-3U.(il) 

92  1 2 3 2 3 


u.(n) 

-is— (R'i + r5>* 


1/2  A A —1  A A 

and  it  is  easily  shown  that  1C'  n'  (nj0^,02^n  rl^n’0i’02^  converges  to 


zero  in  probability  as  r.  » regardless  of  the  rate  at  which  0^  and 


©2  converge  to  0'£  and  0*. 


(3)  In  the  preceding  sections  we  defined  alternatives  relative  to  a 


predetermined  procedure  in  order  to  achieve  a power  level  between  a and  1. 


In  particular,  the  functions  and  Kn  were  first  specified  and  then 


the  conditions  (2.38)  - (2.41)  on  the  alternatives  were  imposed.  In 


practice,  however,  the  true  underlying  distribution  is  unknown  but  fixed. 


and  it  is  L and  K which  must  be  chosen.  To  this  end,  the  form  of 
n n 


the  power  function  indicates  that  choosing  as  large  as  possible  would 


.T*±3».  i**,. 
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lead  to  the  greatest  power  for  fixed  (large)  n when  the  underlying  dis- 
tribution belongs  to  the  alternative  hypothesis.  The  quantity  y can 

S 

1/2—  — 

be  written  as  Lim  L K y (n),  where  y (n)  is  some  nonnegative  function 


n-Ho 


n n 'g 


g 


of  the  true  distribution  as  well  as  the  null  hypothesis.  Suppose  we  take 

= MLn,Kn=  (l/M)Kn  for  any  positive  constant  M greater  than  one.  Then 

it  is  easy  to  see  that  assumptions  (2.1)  through  (2.6)  still  hold  (i.e. 

L and  K are  feasible),  but  now  y has  been  increased  to  y*  = My  , 
n n g g g 

with  a corresponding  increase  in  the  asymptotic  power  of  the  test. 

There  are,  however,  additional  considerations  in  choosing  L^.  The 
interpretation  of  Ln  is  that  of  the  "spacing"  between  the  successive 

A A 

sample  quantiles  which  are  used  in  computing  the  test  statistic  Ttn;©^^) 
(here  we  mean  "spacing"  in  terms  of  the  indices  of  the  order  statistics, 
not  the  values  which  they  assume).  The  larger  is  made  (for  fixed  n), 

the  fewer  the  number  (K^+l)  of  sample  quantiles  which  are  used.  For 
"moderate"  n,  therefore,  taking  Ln  large  (say  n , where  6 is  close 
to  1)  may  result  in  an  inferior  test  since  very  few  quantiles  will  actually 
contribute  to  the  test  statistic.  Evaluation  of  this  conjecture  would 
require  extensive  empirical  studies  and  is  not  done  here. 

We  did,  however,  use  Monte  Carlo  methods  to  study  T(n;0^,02)  and 


A A 


T(n;6^,02)  under  the  null  hypothesis.  The  purpose  was  to  determine  the 
extent  to  which  the  distributions  of  whose  statiscics  resembled  normal 
distributions  for  small  to  moderate  n and  whether  or  not  (a)  the  choice 
of  Ln  or  (b)  the  use  of  parameter  estimates  affected  the  degree  of 
approximation  to  the  asymptotic  distribution  theory.  The  methods  and  re- 
sults are  presented  in  Appendix  A.  We  concluded  from  the  study  that  the 
asymptotic  distribution  theory  is  not  adequate  for  determining  the  criti- 
cal regions  of  the  test  (2.31)  for  moderate  n (at  least  n4300  and 
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probably  much  larger).  Deviations  from  normality  were  particularly  evi- 
dent in  the  tails  of  the  empirical  distributions  of  TCn;©^,©,,)  and 

A A 

T(n;0^,02),  although  those  deviations  may  have  been  amplified  by  the 
limited  number  of  replications  in  our  study.  Not  surprisingly,  the 
distributions  of  the  statistics  exhibited  greater  non-normal  behavior 
when  the  parameters  0^  and  02  were  estimated  from  the  sample  than 
when  they  were  known  and  specified.  The  effect  of  the  choice  of 
on  the  distributions  or  rate  of  their  approach  to  normality  was  not  clear. 

2.5  Examples 

2.5.1  Obtaining  the  "Standard  Representative"  of  a given  Scale-Location 
Parameter  Family 

Throughout  this  chapter  we  have  assumed  tnat  the  distribution  G(x) 
satisfied  G 1(l/2)  = 0 and  G-1(3/4)  - G-1(l/4)  = 1.  As  noted  previously, 

A A 

this  assumption  allowed  us  to  use  the  same  estimators  0-^Cn)  and  02(n) 
without  our  having  to  carry  along  in  our  calculations  certain  constants 
depending  upon  the  particular  G(x)  appearing  in  HQ.  The  disadvantage 
of  this  representation  is  that  the  parameters  0.^  and  02  are  not  the 
"usual"  location  and  scale  parameters.  In  this  subsection  we  derive  the 
equations  which  must  be  solved  to  determine  0^,02  an<^  ®(x)  given  the 
more  common  form  of  the  distribution  and  solve  them  for  many  useful  cases. 

Let  F(x)  = G((x-0jl/02)  be  given.  The  relation  G 1(l/2)  = 0 implies 
that  1/2  = G(0)  = F^).  Now,  let  xx  = G-1(l/4)  and  x2  = G_1(3/4). 

Then,  1/4  = G(x^)  = F(02x^+0^),  and  3/4  = G(x2)  = F(02x2+0^).  Since 
x2  - x^  = 1,  we  have  the  following  system  of  equations  which  can  be  solved 
for  an<^  xi: 
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F(01) 

= 1/2 

F(02X1+01> 

= 1/4 

(2.68) 

F(02(i+x1)+e1) 

= 3/4 

Once  0^  and 
suppose  we  are 

9 2 are  found,  G(x)  = F^xt0^) 
interested  in  the  normal  family 

is  determined.  For  example, 
F(x)  = $(  (x-u)/cf)  . Then 

the  equations  (2.68)  become 


0,-11 
U \ ) 

= 1/2 

8 x te 

H 0 ) 

= 1/4 

02(l+x1)+81-y 
$(  0 ) 

= 3/4 

The  first  equation  above  yields  01  = p.  The  second  equation  then  gives 
02xl /°  = -,6745;  the  third  equation  gives  02(l+x1)/a  = .6745,  and  together 
they  imply  02  - 1.349.  Thence,  G(x)  = 4>(1.349x)  and  g(x)  = 1. 349<J)(1. 349x) . 
Table  2.2  lists  the  results  of  the  above  transformation  applied  to  five 
common  distributions. 

2.5.2  Weibull  Alternatives 

A common  way  to  study  the  asymptotic  power  of  a test  is  to  choose  a 
parametric  family  of  alternatives  and  examine  the  behavior  of  the  test  as 
one  or  more  of  the  parameters  approach  limiting  values.  Since  we  are 
dealing  with  a composite  hypothesis  which  includes  an  entire  scale-location 
parameter  family,  we  must  go  to  a three  parameter  family  in  order  to  study 


STANDARD  REPRESENTATIVES  OF  SCALE-LOCATION  PARAMETER  FAMILIES 


Logistic  [1  + exp{-(x-a)/g}]  [1  + (1/9)X]  a gin 
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the  asymptotic  power  of  the  test  (2.31)  using  this  approach.  A natural 
family  is  the  Weibull  family,  which  approaches  the  exponential  family  as 
the  "shape"  parameter  converges  to  one.  This  family  has  proved  to  be 
particularly  well  suited  to  the  metric  arising  in  tests  based  on  sample 
spacings  (Blumenthal  [2]);  however,  the  computations  for  the  test  (2.31) 
are  considerably  more  tedious. 

We  define  the  sequence  of  alternatives 


H (x) 
n 


0 


if  x > 0 
otherwise 


where  3^  is  the  "shape"  parameter,  and  where,  without  loss  of  generality 
we  have  taken  the  location  and  scale  parameters  to  be  0 and  1 respectively 
Then 


and 


S(K) 


hn(x) 


6nx  e 


(x  > 0) 


(x  > 0). 


1/2  1/4 

We  first  consider  the  assumption  (2.40).  Defining  a(n)  as  , 

we  will  ow  that 


B = 1 - r for  some  r,  0 < r < «° 

n a(n) 


satisfies  the  assumption,  with  the  value  of  r to  be  determined  below. 


Consider  the  function 
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S(x,n) 

Since 


[e'(x)]2  . . . ..  . ^n  v 

— 2 = h (x)  - 2°~X  + Cl  - lx"(r/a(n))e(x  _x)e"x 

hn(x)  VXJ  + U a(n)JX  8 8 ' 


Jx-(i./a(n))  -x  . r(1  . r , 

^ a(n) 

is  finite  for  r < a(r. ),  S(x,n)  is  clearly  integi  able  for  all  r such 
that  a(n)  > r.  Furthermore,  S(x,n)  is  easily  shown  to  be  strictly  de- 
creasing in  n,  and  we  show  below  that 

Lim  a2(n)S(x,n)  = 2r2(x-l)2(log  x)2e~x  (2.69) 

n-*» 


with  (see  [14]) 

03  2 

/(x-l)2(log  x)2e  xdx  = y2  - 2y  + 2 + 
o 

(y  = .5772157  is  Euler's  constant).  Hence,  we  may  use  the  dominated 
convergence  theorem  to  write 

CO  00 

0 0 O 

Lim  a (n)/s(x,n)dx  = /Lim  a (n)S(x,n)dx  = (5.6474)r 
n 0 0**° 

and  choosing  r = .4208dg  satisfies  (2.40). 

To  prove  (2.69),  we  note  that  for  fixed  x we  have 

-xS"  . -Cx  - iTSxl°s  x + o(ifn7):i 

e - e 

= e-xeWa(n))xlogxa+oa)) 
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as  n ■+  ».  Hence,  neglecting  smaller  order  terms, 


a2(n)S(x,n)  = a2(n)e-’‘[x(l’/a(n))<x-1)  + x(r/a(n) )<l-x>  _ 2L 


In  order  to  determine  the  limit  of  the  above,  we  treat  n as  a continuous 
variable  and  denote  - by  a'(n).  Using  the  fact -that 


d (r/a(n))(l-x)  _ r 


= 5 a' (n)(x-l)(log  x)x 

a^(n) 


(r/a(n))(l-x) 


we  apply  L'Hopital's  rule  twice  and  obtain 


a2(n)S(x,n)  = L^m 

a (n) 


rx(l0ex)[(x-l)x(r/a(n))(’1-1)t(l-x)x(l,/a(n))(1-x)] 


a-1(n) 


7.  2 -x,.  -,2r , . .2  (r/a(n))(x-l)  , 

L^m  r e (log  x)  L(x-l)  x + 


(l-x)2x(r/a(n))(1-’!)3 


= 2r2(x-l)2(log  x)2e  x 


as  desired. 


For  this  example,  however,  it  is  assumption  (2.39),  not  (2.40), 
which  actually  determines  the  power  of  the  test.  In  particular,  we  will 
now  show  that  (2.39)  requires 


B = 1 - 





where  a(n)  = a(n)/p  and  r is  positive  and  finite. 

n 

Using  the  "standard  representative"  G(x)  = 1 - ( 1/2 ( 1/3 ) from 
Table  2.2,  tedious  calculations  yield  the  quantities  appearing  in  defini- 
tion (2.38).  These  expressions,  neglecting  smaller  order  terms,  are 
summarized  in  Table  2.3. 


TABLE  2.3 


EXPRESSIONS  REQUIRED  FOR  WEIBULL  POWER  COMPUTATION 


Expression 


G_1(x) 


log  (2(l-x)) 


log  3 


g(G  (x))  (l-x)log  3 


g(G  1(x))G  1(x)  (x-l)log(2(l-x)) 


log  2 


Expression 


H~1(x)  (-log(l-x))1/3n 


e (H  (x))  r /.  1 .2.  . 1 

n n = (log  — ) loglogp^ 

a(n) 


vx(n) 


v2(n) 


_r (log  2)  loglog 

loS  3 


a(n)  (log  4/3  loglog  4/3 


3 log  3 
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Since  v*  = Lim  a(n)v, (n)  and  v*  = Lim  a(n)v^(n)  are  zero,  it  is  easily 
1 n-x»  1 2 n-*»  2 * J 

seen  that  the  only  contribution  to  the  quantity  y comes  from  expression 

§ 


(2.61).  Specifically, 


(1-p  )log  3 log(2(l- 

{_ 2 [d  - d r— -5 

a(n)log  2 g 


log(2(l-p  ) ) 0 

n i z 

"«  r\  J 


y = Lim  a (n)  p log  3 log(2p  ) „ 

g + 


rlog  3 j j "i  2 

[lSF2  dl  ' d2] 


3.4212  r2 . 


and  the  asymptotic  power  of  the  test  (2.31)  is  given  by 


<Kc.  + 2.419  r ). 

$,a 


2.5.3  Normal  Contamination  Alternatives 


We  consider  the  sequence  of  alternatives 


H (x)  = (1-6  H(x)  + 6 $(x+p  ) 

n n n ^n 


where  the  sequence  {6^}  converges  to  zero  as  n 00 . These  alternatives 
provide  a basis  of  comparison  of  the  test  (2.31)  with  tests  designed 
specifically  for  the  normal  hypothesis  which  we  discuss  in  the  next 
chapter.  Using  Taylor  series  expansions  (see  example  3.1),  we  have 


’rnmumim* 


en(x)  = 6n($(x+pn)  - 4>(x)} 


= 6nPn^  - (Pn/2)x  + (p2/6)(x2-l)H(x)  + 0(5nP^) 


and 


en'(x)  = 5nPni~x  + (pn/2)(x2-l)}<{>(x)  + o(6np2), 


1/2  -1/2  -1/4  -1 

It  is  easily  shown  that  condition  (2.40)  requires  6 = dQ  L K p , 

n o n n n 


condition  (2.39)  is  satisfied  with  = ^2  = °9  and  (2,4D  is  satisfied 


for  b(n)  = (nLnPn)  . Considering  the  quantities  required  for  the  power 


computation,  v. (n)  = 6 p (l+o(l))  and  = o(6  p ) : so  that  v*  = dl 
1 n n 2 n*n  1 3 

and  V"  = 0.  Thus,  in  addition  to  (2.61),  (2.64)  and  (2.65)  make  no 


1/2 


contribution  to  y for  this  example.  Also,  using  (2.67)  and  taking 
© 


G(x)  from  Table  2.2, 


1_pn  e^(H~1(x))  g'(G"1(x)) 

J 1 dx 


1-P. 


n 


(1.349)*6  p J [1  - (P  /2)<t  1(x)]-I>  1(x)dx 
n n 1 n 

pn 


pn  g(G_1(x) ) 


+ °(5  PJ 
n n 


= (1/2)(1.349)26  p2  + o(6  p2) 
n n n n 


from  which  it  follows  that  (2.63)  vanishes.-  Thus 


y = d,  + (v*)2  f (1.349)V<j>(1.349x)dx 
S -j  J- 


5 2 . 


= dg(l  + (1.349 j) 


and  the  asymptotic  power  of  the  test  (2.31)  is  given  by 


*(c*,a  + 1*"d3)- 


igjgrgv-gsg- 


: - ' A 


1 
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CHAPTER  III 


LARGE-SAMPLE  TESTS  FOR  NORMALITY 


3 . 0 Introduction 

In  both  theoretical  and  applied  statistics,  the  most  common  distri- 
butional assumption  which  is  made  is  that  of  normality,  and  a great  deal 
of  research  has  been  directed  toward  testing  the  validity  of  that  assump- 
tion. In  this  chapter,  we  develop  a test  based  on  a gradually  increasing 
number  of  order  statistics  which  is  a large-sample  analog  of  the  Wilk- 
Shapiro  test  for  normality. 

In  section  3.1,  we  survey  the  literature  concerning  the  Wilk-Shapiro 
and  related  tests. 

In  section  3.2,  we  describe  the  statistic  ?/(n).  Using  the  asymptotic 
distribution  theory  discussed  in  chapter  2,  we  determine  the  asymptotic 

f\j 

distribution  of  W(n)  under  the  null  hypothesis  and  use  that  distribution 
to  define  our  test  for  normality. 

In  section  3.3,  we  prove  that  the  W-test  is  consistent. 

In  section  3.4,  we  determine  the  manner  and  rate  of  approach  to  the 
null  hypothesis  that  a sequence  of  contiguous  alternatives  must  satisfy 
in  order  to  yield  a non-trivial  power  as  the  sample  size  n tends  to 
infinity. 

In  section  3.5,  we  use  the  results  of  the  previous  sections  to  define 
and  analyze  three  additional  statistics  which  give  rise  to  improved  tests 
for  normality. 

In  section  3.6,  we  give  examples  of  the  behavior  of  these  tests  under 
specific  sequences  of  alternatives. 

In  section  3.7,  we  discuss  the  foregoing  results  as  well  as  possible 
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CHAPTER  III 


LARGE-SAMPLE  TESTS  FOR  NORMALITY 


3.0  Introduction 

In  both  theoretical  and  applied  statistics,  the  most  common  distri- 
butional assumption  which  is  made  is  that  of  normality,  and  a great  deal 
of  research  has  been  directed  toward  testing  the  validity  of  that  assump- 
tion. In  this  chapter,  we  develop  a test  based  on  a gradually  increasing 
number  of  order  statistics  which  is  a large-sample  analog  of  the  Wilk- 
Shapiro  test  for  normality. 

In  section  3.1,  we  survey  the  literature  concerning  the  Wilk-Shapiro 
and  related  tests. 

In  section  3.2,  we  describe  the  statistic  ?f(n).  Using  the  asymptotic 
distribution  theory  discussed  in  chapter  2,  we  determine  the  asymptotic 
distribution  of  $(n)  under  the  null  hypothesis  and  use  that  distribution 
to  define  our  test  for  normality. 

In  section  3.3,  we  prove  that  the  W-test  is  consistent. 

In  section  3.4,  we  determine  the  manner  and  rate  of  approach  to  the 
null  hypothesis  that  a sequence  of  contiguous  alternatives  must  satisfy 
in  order  to  yield  a non-trivial  power  as  the  sample  size  n tends  to 
infinity. 

In  section  3.5,  we  use  the  results  of  the  previous  sections  to  define 
and  analyze  three  additional  statistics  which  give  rise  to  improved  tests 
for  normality. 

In  section  3.6,  we  give  examples  of  the  behavior  of  these  tests  under 
specific  sequences  of  alternati”"  . 

In  section  3.7,  we  discuss  the  foregoing  results  as  well  as  possible 
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areas  of  future  research. 


3.1  Goodness  of  Fit  Tests  for  Normality 


Since  the  appearance  of  the  Wilk-Shapiro  test  eleven  years  ago, 
considerable  attention  has  been  devoted  to  testing  the  compound  hypothesis 
that  a sample  of  data  comes  from  a population  whose  underlying  c.d.f.  is 
an  unspecified  member  of  the  normal  family.  The  primary  focus  of  such 
research  has  been  to  understand  and/or  improve  upon  the  original  W-test. 

We  begin  by  discussing  this  test,  follow  that  with  a discussion  of  several 
of  the  related  tests  which  were  subsequently  proposed,  and  throughout, 
give  a brief  survey  of  the  empirical  studies  which  have  provided  the  fuel 
for  research  in  this  area. 

3.1.1  The  Wilk-Shapiro  W-test 

The  Wilk-Shapiro  test  was  developed  as  a means  of  formalizing  a 
rather  informal  procedure  for  testing  fit — the  probability  plot.  Briefly, 
a probability  plot  is  a regression  of  the  ordered  observations  on  the 
expected  values  of  the  order  statistics  from  a standardized  version  of 
the  hypothesized  distribution. 

Suppose  X^,  Xg,...,  is  a random  sample  with  common  distribution 
function  G(x)  and  denote  the  ordered  sample  values  by  <_  4 • • • 4 

Also  suppose  that  the  null  hypothesis  is  true,  i.e.  G(x)  = $(— — )•  Then 
we  can  write 


Y.  = p + aZ. 

l l 


i — 1,2, ... ,n 


where  Z^,  Z^,...,  Z^  are  the  order  statistics  of  a sample  from  a standard 
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normal  population.  Shapiro  and  Wilk  [27]  used  the  square  of  the  generalized 
least  squares  estimator  of  a obtained  from  the  above  linear  model  (i.e. 
the  square  of  the  slope  of  the  regression  line),  divided  that  by  the  usual 
symmetric  sum  of  squares  about  the  mean,  standardized  the  resulting 
statistic,  and  obtained 


W = m,V“1V"1m 

l (Y.  - Y)2 
i=l 

where 


m.  = E[Z. ] 

l i 


m'  = (m1,...,mn) 


V. . = cov(Z..Z.) 
13  i 3 


V = [V..]. 
13 


Some  elementary  facts  about  W are  that  it  lies  between  0 and  1 and  is 

location  and  scale  invariant.  Under  the  null  hypothesis,  the  numerator 

and  denominator  of  the  above  statistic  are,  up  to  a constant,  estimating 

2 

the  same  quantity,  a . The  distribution  of  W under  the  null  hypothesis 
was  computed  empirically  using  Monte  Carlo  methods  and  tabulated  for 
n = 3(1)50.  Under  the  alternative  hypothesis,  Shapiro  and  Wilk  found, 
again  empirically,  that  the  values  of  the  statistic  tended  to  be  smaller 
than  under  the  null  hypothesis.  No  analytical  results  have  been  obtained, 
either  under  HQ  or  H^,  for  finite  n or  asymptotically  as  n tends 
to  infinity.  But  in  empirical  studies,  the  W-test,  which  rejects  HQ 
for  small  values  of  W,  has  been  shown  to  be  at  least  as  good  and  generally 
better  than  most  of  the  commonly  used  tests  over  various  symmetric. 
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asymmetric,  short-  and  long-tailed  alternatives. 

By  far  the  most  extensive  of  these  empirical  power  studies  was  reported 
by  Shapiro,  Wilk,  and  Chen  [29],  In  that  study,  they  compared  W with 
eight  other  tests,  including  the  Kolmogorov- Smirnov,  Cramer-von  Mises, 
and  Anderson-Darling,  over  twelve  families  of  alternative  distributions. 

They  found  the  W-test  to  be  a more  sensitive  indicator  of  non-norm-: ’.ity 
than  any  of  the  other  tests;  in  addition  to  exhibiting  better  power 
against  most  alternatives,  it  was  the  only  test  which  never  had  a low 
power  when  another  test  had  a high  power  against  a continuous  alternative 
distribution.  Stephens  [30]  later  amended  their  results  in  comparing  W 
to  the  various  EDF  tests  mentioned  above.  He  pointed  out  that  the  study 
[29]  computed  the  EDF  statistics  by  treating  the  means  and  variances  of 
the  alternative  distributions  as  if  they  were  completely  specified  rather 
than  using  estimates  btained  from  the  samples.  Stephens  still  found, 
however,  that  the  W-test  was  generally  superior  to  the  best  EDF  competitor, 
though  usually  only  slightly.  Chen  [3]  studied  the  behavior  of  W under 
scale  and  location  contaminated  normal  distributions,  finding  the  W-test 
to  be  sensitive  to  both  types  of  deviations  from  normality,  although  the 
power  dropped  off  rapidly  as  the  contaminating  distributions  approached 
the  normal. 

The  primary  drawbacks  to  the  use  of  the  W-test  are:  1)  the  coeffi- 

cients cannot  be  computed,  but  must  be  obtained  from  tables,  a different 
set  of  coefficients  for  each  different  value  of  n;  and  2)  the  coefficients 
(in  particular  the  v^'s)  are  only  known  exactly  for  n < 20.  (Shapiro 
and  Wil.c  used  linear  regression  to  extrapolate  the  coefficient  values  up 
to  n = 50. ) 
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In  a later  paper  [28],  Shapiro  and  Wilk  found  good  empirical  agree- 
ment between  the  distribution  of  W and  a member  of  Johnson's  Sg  class 
of  distributions. 

3.1.2  Related  Tests 

The  nonavailability  of  the  ' s for  n >_  20  prompted  Shapiro 
and  Francia  [26]  to  propose  the  following  statistic  as  an  approximation 
to  W: 


W* 


(m'Y)2 

m'm 

n _ o 

l (Y.  - Y)2 


They  justified  the  use  of  W*  on  the  grounds  that,  for  large  n,  the 

order  statistics  { Y^}  could  be  treated  as  if  they  were  independent. 

This,  of  course,  is  false,  but  Stephens  [31]  later  showed  that  in  the 

normal  case  the  vector  of  mean  values  of  the  standardized  order  statistics 

is  an  asymptotic  eigenvector  of  the  covariance  matrix  of  those  statistics. 

Thus,  the  statistic  W*  is  in  fact  asymptotically  related  to  W,  and 

has  the  advantage  that  the  vector  m is  known  up  to  n - 400.  Shapiro 

and  Francia  have  computed  the  critical  region  for  the  W‘*-test , which 

rejects  for  small  values  of  W*,  for  n = 35,50(1)99.  Again,  no  analy 

tical  results  are  available,  but  in  empirical  studies  [26,30,35],  the 

y:'c_-test  has  been  shown  to  be  very  similar,  but,  in  general,  slightly 

inferior  in  performance  to  the  W-test,  although  Shapiro  and  Francia  found 

the  W*-test  to  be  more  sensitive  to  "near  normal"  alternatives.  Sarkadi 

* 

[22]  proved  the  consistency  of  the  W*-test  against  alternatives  wi.h 


finite  second  moments. 
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The  most  extensive  analytical  work  in  this  area  has  been  done  by 
De  Wet  and  Venter  [9,10],  who  were  concerned  with  the  statistic 

(m'Y)2 

2 m'm 

r = 

n n 0 

l (Y.  - Y)2 

i=l 

where  m1  = (in, , . . . ,in  ) with  in.  = 4>  1(— 1 They  determined  the  asymp- 
1 * n x n+1  J 

2 

totic  distribution  of  n(l  - r ) - a under  the  null  hypothesis  to  be 
T(y),  where  T(y)  is  the  distribution  of 


where  Xg,  X^, . . . are  i.i.d.  standard  normal  random  variables,  and  where 
an  is  a complicated  function  which  is  o(log  n)  as  n tends  to  infinity. 
Their  methods  are  discussed  in  more  detail  in  Appendix  B.  The  percentiles 
of  T(y)  were  computed  numerically  using  the  inverse  of  the  characteris- 
tic function.  As  we  might  expect  since  the  nu  are  limiting  values  of 
the  nu,  good  agreement  was  found  between  rn  and  W*  (for  n = 99);  it 
would  b-  difficult,  however,  to  make  a rigorous  statement  concerning 

2 

the  rex.  onship  between  the  two  statistics.  Further  remarks  about  rn 
and  W*  as  well  as  additional  empirical  approximations  to  the  distribu- 
tions of  W and  W*  appear  in  Stephens  [33]. 

In  other  work  building  on  his  earlier  results  on  covariance  matrices, 
Stephens  [32]  considered  the  more  general  linear  model 


T 
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E(Y.)  = p+  am.  + 3„w0(m.)  + 6qW.(m. ) + . 

1 1 Z Z 1 001 


where  are  constants  and  w0(m.),  w0(m.),...  are  functions 

of  the  itk,  in  particular,  the  Hermite  polynomials.  Under  the  null  hy- 
pothesis, 0 = $2  = ^3  = * ' * 5 an<^  a *arSe-samP^-e  tes’t  ’t^ie  latter 
event  is  proposed  which  may  be  extended  to  as  many  3^  as  the  experi- 
menter wishes.  Specifically,  by  choosing  W2(.),  w^C.),...  as  he  did, 

A A 

Stephens  was  able  to  obtain  simple  expressions  for  the  estimates  B2,B3,. 
and,  under  the  null  hypothesis,  show  that  standardized,  studentized  ver- 


sions of  these  estimates  are  asymptotically  independent  and  normally 

:2  :2 


distributed.  Thus,  the  sum  of  the  square  estimates,  B2  + g3  + ...  has 
a chi-square  distribution  with  the  number  of  degrees  of  freedom  deter- 
mined by  the  number  of  estimators  which  the  experimentor  wishes  to 
consider.  Stephens’  test  is  based  on  the  latter  distribution. 

Finally,  D'Agostino  [5]  considered  tests  based  on  the  following 
statistic: 


I Ci  - (l/2)(n+l)]Y. 
i=l  1 


v\2-il/2 


[n  l (Y.  - Y )"] 
i=l  1 


D'Agostino  showed  that  under  the  null  hypothesis,  D is  asymptotically 

Pi 


normally  distributed,  but  he  found  that  n must  be  well  over  1000  before 
the  asymptotic  distribution  is  useful  in  determining  the  critical  regions 
of  his  test.  Instead,  he  used  Cornish-Fisher  expansions  (based  on  the 


first  four  cumulants  of  D^)  to  determine  the  percentiles  of  the  dis- 


tribution of  a standardized  version  of  D^.  In  an  empirical  power  study, 


he  found  that  a two-sided  test  based  on  D^  was  abxe  to  detect  deviations 


from  normality  due  to  both  skewness  and  kurtosis  (i.e.  the  test  is  an 


1 
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"omnibus"  test).  He  also  found  that  this  test  was  comparable,  but  again 
slightly  inferior,  to  the  W-test.  In  a later  study,  Theune  [35]  found 
the  Wilk-Shapiro  test  to  be  "far  superior"  to  D'Agostino's  test  in  de- 
tecting non-normality.  Theune  did,  however,  find  the  D'Agostino  test 
to  be  comparable  to  the  Shapiro-Francia  test  over  most  of  the  alternative 
distributions  studied. 

3.2  A Large-Sample  Test  for  Normality  Related  to  the  W-Test 

In  this  section  we  describe  a statistic  ft(n)  which  is  an  analog 
of  the  Shapiro-Wilk  W based  on  an  increasing  subset  of  the  order  statis- 
tics. By  appropriately  selecting  our  vector  of  order  statistics  as  in 
the  previous  chapter,  we  are  able  to  invoke  the  asymptotic  distribution 
theory  of  section  2.2.4  and  thereby  determine  the  asymptotic  behavior  of 
2f(n).  Based  on  the  asymptotic  distribution  of  n)  under  the  null 

hypothesis,  we  will  define  the  a-level  critical  region  for  our  test.  In 
view  of  the  asymptotic  sufficiency  result  of  Weiss  [40],  we  would  hope 
that,  at  least  for  large  n,  the  powers  of  an  a-level  test  based  on  fr(n) 
and  an  a-level  test  based  on  W (assuming  the  necessary  coefficients 
were  available)  would  be  "close"  in  some  sense.  In  that  event,  the 

f\j 

large  sample  behavior  of  the  w-test  would  help  to  explain  the  behavior 
of  the  W-test,  and  at  the  same  time,  the  empirically  determined  superi- 
ority of  the  W-test  for  small  samples  would  provide  some  justification 
for  using  the  W-test  when  n is  large.  The  relationship  between  the 
two  tests  remains  open  question.  Some  further  remarks  concerning  thi3 
relationship  appear  in  subsection  3.2.4. 

Before  defining  our  test  statistic,  we  recall  some  notation  and  results 
from  chapter  2. 


B 


3.2.1  Definitions 


b« 


Let  the  sequences  of  integers  (npn),  (Kn),  and  {L^}  be  as  defined 
in  section  2.2.1  with  the  assumptions  (2)  through  (8)  replaced  by 


Lim  p = 0 Lim  K = °°  Lim  np  = 
n^  *n  n-*05  n n-+<»  n 

L 


Lim 

n-?<» 


npnlog(l/pn) 


(3.1) 

(3.2) 


r • n 

Lim  — 0 /0 — ~ — 

n-*»  t3/2  2.  f.  , s 
L p log (1/p  ) 
n n n 


= 0 


3 
n P. 

Lim  „ 
n-x»  ^4 

n 


n 


(3.3) 


(3.4) 


We  note  that  (3.2)  and  (3.3)  above  are  exactly  (2.4)  - (2.6)  where  we  have 

orme 
3/4 


replaced  Tn  by  its  value  for  large  n (up  to  a constant)  in  the  normal 


case  as  given  by  (C.10).  The  assumption  (3.4)  is  new.  For  L^  >.  n 

it  clearly  imposes  no  added  restrictions  on  our  choice  of  parameters.  For 
3 /4 

Ln  = o(n  ),  it  requires  in  effect  that  our  selected  subset  of  order 
statistics  move  into  the  tails  fairly  rapidly  as  n gets  large. 

Throughout  this  chapter  we  will  denote 


..  np  +(i-l)L 

— ”) 


n 


by  T^(n)  and  <f>(T^(n))  by  t^(n),  for  i = 1,2, . . . ,Kn+l.  Also,  we  will 
denote  $ ^(x)  by  T(x)  and  <KT(x))  by  t(x).  We  will  denote  the 

(K  +l)-vector  of  selected  order  statistics  by  ?(n),  with  components  given  by 


■aaaraetHT'-' 
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<\,  'V 

Y.  = Y.(n> 


r 


- 


qnYnpY. 


nV(i“1)Ln 


^n^nd-p  ) 


i=l 


i=2j3, . . . ,K 


i=K  +1 
n 


where 


_ , n Vn)rVn>T1<n> 
2LnV^  p„ 


+ T.  (n)  - 1 + — — — , — i ~-\— 
1 nt. (n)  3 n 


L T (n)  . L 1+T,(n)  , /9 

n l 1/  xi\2,  1 v-j-vX/4 


+ ¥zr-r(- 


t^(n) 


-)]}J 


(3.5) 


Let  $(n)  = (5^,...,$^  +^)  be  the  analogous  order  statistics  selected 

n 

from  a sample  whose  parent  population  is  standard  normal.  Then,  in  view 

of  the  asymptotic  distribution  theory  of  section  2.1.4,  we  can  assume  for 

all  asymptotic  probability  calculations  that  the  vector  &(n)  has  a 

(K  +l)-variate  normal  distribution  with  mean  vector 
n 

'f'(n)  = (T^Cn ),..., +1(n))'  = (q^Cn^T^n), . . . ,TK  (n)  ,0^  +1(n) ) ' 


and  covariance  matrix  V(n)  whose  components  v^^(n)  are  given  by 


r'Vn<1-pn)/tl(n) 


vX(n)tK  +1(n) 

n 


i=j=l 


i=l, j=K  +1 
’ J n 


— H ( 

n(L  -1)  X 
n 


Vn(1-V(j-1)T)/tl(n)tj(n) 

L L 

(p„  + d-l)-^-)(l-pr,-(  j-l)-^-)/t . (n)tj  (n) 


n rn 


n i 


qn(Pn+(i-1)-j)Pn/t1(n)tK  +1<n> 


Vna-pn)/tK  +l(n) 
n 


l=i<j<Kn 


(3.6) 


= ='=•  n 


2<i<j=K  +1 
= J n 


i=j=K  +1 
J n 
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The  inverse  of  the  above  covariance  matrix  has  the  simpler  form: 


n (L  -1) 

V~1(n)= 


n 


1+ 


nV2  tlt2 


2 


% 


^n 


-tlt2 


^n 


2t 


*2*3 


— *fc  *t 

x2  3 


2t! 


-t  t 2t 
-I  K lK 
n n n 


_tK  tK  +1 
n n 


L 

1 + -2. 


_tK  \ +1  * ‘ np  . 
n n rn,_2 

— ^ — 1 — V+l 


qn  n 


. (3.7) 


'V 


3.2.2  The  W-Test 


We  are  concerned  with  the  testing  problem  of  section  2.0,  with 


G(x)  = <Kx),  the  standard  normal  c.d.f.  Analogous  to  the  Wilk-Shapiro 


statistic,  we  define  our  test  statistic  W(n)  as 


to.)  = _ ja^vi^isa 


(3.8) 


C?' (n)V"1(n)V“1(n)T(n)]D2(n) 


where 


K +1 
n 


K +1 
n 


i=l 


D2(n)  = l [Y. (n)  - Y.(n)]‘  with  Y.(n)  = (K +1)  1 £ Y.(n) 

1 i=l 


'V 


Under  H , the  numerator  of  W(n)  may  be  thought  of  as  the  least  squares 
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estimator  of  the  variance  a based  on  Y(n).  The  denominator , as  we  shall 
prove  later,  has  the  same  asymptotic  distribution  (after  suitable 
standardization)  as  the  denominator  of  W. 

Lemma  3.1.  For  all  n , 

(i)  W(n)  i£  location  and  scale  invariant 
(ii)  0 4 W(n)  < 1. 

Proof.  Identical  to  the  proofs  of  Lemmas  1 and  2 of  Shapiro  and  Wilk  [27]. 
Lemma  3.2.  As  n °°, 

np  1/2 

^ = (j^)  (l+o(l))  (3.9) 

1 n 


Proof.  The  proof  is  based  on  the  following  formula  of  Feller  [13, p. 193]: 


113  15  l~$(x)  1 1_  _3 

x " 3 5"  7 <j)(x)  x ~ 3 5 

XXX  XX 


(3.10) 


for  all  x > 0.  Taking  x = $ ^(l-p^)  = -T^(n)  in  (3.10),  we  have 
<J>(x)  = t^(n)  and  (3.10)  becomes 

Tl(n)  + T^(n)  T^(n)  + T^(n)  tl(n)  Tl(n)  + T^(n)  T^n)’  (3-10,) 

2 

Multiplying  both  sides  of  (3. 10')  by  t^iOT  (n)/pn  and  rearranging  terms 
yields 


t (n)  3t  (n)  15t  (n)  t (n)T  (n)  _ t (n)  3tn(n) 

+ r < + T:(n)  < 7-v ^ .(3.11) 

1 p T (n)  m3; 

n 1 


PnTl(n)  p T?(n)  pTj(n)  Pn 

n 1 n 1 


W*> 
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Multiplying  both  sides  of(3.10')by  t^(n)/pn  and  rearranging  terms  yields 


t^n)  St^Cn)  lSt^Cn)  t^n)  t^n)  3t^(n) 

P„T1<">  ‘ PnTj(n)  + pnT’<„>  " PnVn)  + ' ' PjJtn)  ’ P^Cn)  ' 


(3.12) 


Substituting  (3 .12  ) into  (3.11)  we  obtain 


2t  (n)  12t.(n)  15t  (n)  t (n)T  (n)  0 

-2  - — i- + ± + 1 + Tf(n)  - 1 

pnTl(n)  pnTl(n>  pnTI(n) 


< -2  - 


2t1(n)  3t1(n) 

pnTl(n)  _ ^)' 


(3.13) 


Thus,  using  (C.7)  and  (C.8), 


t (n)T  (n)  _ 

— p- + Ti(n)  - 1 = -2U  + 0(go'iTi/^7» 


as  n «.  Finally,  combining  the  above  relation  with  the  definition 
2 

of  q^  and  using  (3.2)  to  e .iinate  smaller  order  terms,  we  obtain 


% = L (1  + °(21ogtl/p  ))}  ‘ 


This  completes  the  proof  of  lemma  3.2. 


Since  we  are  concerned  here  on3.y  with  asymptotic  results,  one  might 

ask  why,  in  view  of  lemma  3.2,  the  definition  (3.1)  was  given  for 

2 

instead  of  the  much  simpler  and  more  easily  computable  = np^/L^?  The 
answer  is  that  even  for  large  n,  the  smaller  order  terms  in  (3.5)  play  a 


— 
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crucial  role  in  the  asymptotic  distribution  of  W(n).  The  e ;act  difficul- 
ties which  arise  if  those  terms  are  neglected  will  be  pointed  out  later. 

In  many  of  the  arguments  below,  however,  it  will  be  true  that  those  terms 
may  be  neglected  for  asymptotic  purposes,  in  which  case  the  result  of 
lemma  3.2  will  be  useful. 

f\j 

Let  p(n)  denote  the  quantity 

CTMn^'^n^n)]2 
[T*  (n)V-i(n)V-A(n)f(n)]f ' (n)‘f(n)  * 


The  next  subsection  of  this  chapter  is  devoted  to  proving  the  following 
theorem : 


Theorem  3.1.  Under  the  null  hypothesis 


5/23/2 


/ 27x1/2 
v 2} 


(fr(n)  - 


W(n)) 


(3.14) 


is  asymptotically  normally  distributed  with  mean  0 and  variance  1. 


Thus,  we  define  the  a-level  W-test  for  normality  as  follows : 


Reject  iff  W(n)  < p(n)  + c,  a(n) 
0 <J>,a 


where 


a(n) 


/_2\l/2  Ln 
k27>  5/2  3/2 

n Pn 


and  c^  ^ is  the  a-percentage  point  of  the  standard  normal  c.d.f.  The 
reason  for  choosing  the  lower  tail  of  the  distribution  as  our  qritical 
region  is  that  under  the  alternative  hypothesis,  W(n)  tends  to  be  smaller 


than  under  the  null  hypothesis  (this  statement  is  made  rigorous  in  section 


r» j 

3.3).  Figure  3.1  depicts  the  large  sample  behavior  of  W(n)  and  the 
critical  region  $n(a)  of  "*-^e  in  leraroa  3.5  below,  we  show  that 

1 - y(n)  is 

lWcl/p  ) 

°<  %3  ->  • 

n p 
rn 

Hence,  (3.4)  implies  that  the  quantity  c(n)  converges  to  zero  faster 
than  the  quantity  u(n)  converges  to  1,  and  the  100(l-ct)%  confidence 
region  for  V? (n ) approaches  the  curve  p(n)  faster  than  the  curve 


approaches  its  limiting  value. 
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3.2.3  Proof  of  Theorem  3.1 

Since  we  are  assuming  that  the  null  hypothesis  holds,  we  may  also 

assume  without  loss  of  generality  by  lemma  3.1  that  Y^,...,Yn  are  the 

order  statistics  of  a sample  from  a N(0,1)  population.  Hence,  for  all 

our  asymptotic  probability  calculations,  we  will  hereafter  take  the 

distribution  of  ?(n)  to  be  (K  +l)-variate  normal  with  mean  vector  T(n) 

n 

and  covariance  matrix  V(n)  as  defined  in  the  previous  section.  The  . 

f\ , j 

theorem  will  be  proved  by  examining  the  Taylor  series  expansion  of  W(n)  '•} 

•: 

about  the  point  T(n)  and  determining  which  terms  are  asymptotically  ■ 

dominant.  Before  doing  so,  we  introduce  two  lemmas  which  are  required  - 

for  the  proof. 

( 

i 

Lemma  3.3.  As  n ■* 


'b 

var  Y. 

i 


■ 


and 


K +1 
n 

l v..(n) 
j=l  13 


= 


Lniogd/pn) 


(l+o(l)) 


npnlog(l/pn) 


0(1) 


np  1/2 

(— ) 0(1) 

L3 

n 


(l+o(l)) 


L -1 
n 


i=l,Kn+l 


i=2 ,3,»..,K 


i*l,K  +1 


(3.15) 


(3.16) 


i=2 ,3 , . . . ,K 


'V 

Proof.  It  is  easily  verified  that  max  (var  Y.)  occurs  at  i=2  or  i=K^. 

2<i<K  1 

= = n 


An  application  of  (C.10)  yields  (3.15). 
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In  proving  (3.16),  we  first  consider  the  case  i=l  (or  i=Kn+l  by 


symmetry).  Using  the  definition  (3.6)  and  simplifying,  we  have 


K +1 

n£  v..(n) 

3=1  13 


3/2 


■ n 


■ n 


(L  -l)t2(n)  Ly2n1/,2t1  (n)  j=2 

n 1 n 1 


By(C.lO),  the  first  term  is  0([Lnlog(l/p^)3  ^)  as  n ->  «,  and  the  second 


\ Cl-Pn-(3-D^ 


t j (n) 


term  is  approximately 


1-P. 


( Pn  )1/2  n / (I-*)  d;; 

2Lnn  LnVn)  Pn  t(x) 


which  equals 


Tx(n) 


nP^  1/2 

(—?)  (l-2p  )-~y 

3 n t (n) 


Applying  (C.8),  the  first  half  of  (3.16)  follows.  (In  Chapter  3 as  in 
Chapter  2,  we  will  frequently  have  occasion  to  approximate  summation  by 
integration  as  above.  We  again  refer  the  reader  to  Appendix  C,  where  we 
formally  carry  out  this  procedure  and  obtain  upper  bounds  for  the  resulting 
errors  for  integrals  of  the  type  appearing  in  this  chapter.) 

For  i=2,3,...,K  we  have 
’ ’ ’ n 


K +1 
n 


pn<1-V(j-1H?)  pn(V(i-1H;>  Ln 


r , s r n *n  n *n  *n  n n 

.A  Vij  n " t1(n)tiTn)  “ tK  (n)ti(n)  n([T-l) 

J-  Nn 


L L 

L i-1  [p  +(j-l)-^][l-p  -(i-1)-^] 

n r r n n n n 

n(Ln-l)  nt.(n)tj(n) 


. < 
J 
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n :;i 


A nt.(n)t.(n) 

n=x  x 3 


- nr  l + _A_  (if  ♦ (x)[i-*(y)_3  dx 

0 n(L  -l)t. (n)t .(n)  L-l  J,  >.  4>(y) 


nil 


n T1<n) 


TK  +l(n) 


+ ”/  [i-«uw<y)dil  (i+od))} 

J (til  V I 


where  v n,  T.(n).  The  first  term  above  is  o(Cnp  L ] ) as  n ■+  ».  In 

J = i n n 

_i/o  _■}  i/2  -1 

particular,  since  (np  L ) = L (L  /np  ) , it  is  o(L  ).  Noting  that 

r 7 *n  n n n n n 


/ <5>(x)dx  = / [l-<I>(x)]dx, 
-a  y 

the  second  term  above  can  be  written  as 


C(L  -lH(y)j  { / *(x)dx  - y*(y)>. 

" Tl(n) 


! 


rM  1 

h m i 


L 1 ! 

L- 1 i 


Evaluating  the  expression  in  braces,  we  obtain  (p^T^Cn)  + <j>(y)  - t^(n)}. 

An  application  of  (C.8)  completes  the  proof  of  (3.16).  This  completes  the 
proof  of  lemma  3.3. 

It  is  well  known  that  for  the  full  set  (i.e.  all  n)  of  standard 
normal  order  statistics,  the  row  sums  of  the  covariance  matrix  ax^e  equal  to 
1.  Equation  (3.16)  shows  that  the  ratio  of  the  common  row  sum  for  our 
selecxed  subset  of  order  statistics  to  that  of  the  complete  set  is 
asymptotically  equal  to  the  fraction  of  order  statistics  that  we  choose. 


r ‘ ? 
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Lemma  3.4.  The  vector  ^(n)  is  an  asymptotic  eigenvector  of  the  matrix 


-1 


V(n)  corresponding  to  the  eigenvalue  [2(Ln~l)] 


% -1 

Proof.  We  will  denote  the  (K^+D-vector  T'(n)V  (n)  by  y(n ),  with 


individual  components  Y^(n).  It  follows  directly  from  the  definitions  that 


r 


n (L  -1) 


Yi(n)  = —T 


t (n)  L 

{(If— -t .(n)T  (n)  - t_(n)T  (n)} 

Si  npn  1 1 22 


i=l 


ti(n){2ti(n)Ti(n)-ti_1(n)Ti_1(n)-ti+1(n)Ti+1(n)} 


i=2 ,3 , . . . ,K 


tK  +l(n)  L 

— ~ {(1+rr -)t„  (n)Tw  ^(n)-t^  (n)T„  (n)}  i=K  +1 


np  ' K +1  K +lv“'  K K ' 
rn  n n n n 


Using  (C.l),  Taylor  series  expansions  of  t^_1(n)T^_1(n)  and 
yield 


L 


L_  2 T.(n) 


t.  . (n)T.  (n)  = t . (n)T.  (n)  - ...  - T?(n))  -(-£•)  ■ ■ 7— r 

l-l  l-l  11  n 1 n t.UJ 


L 3 1+T2(n)  l,Ln  4Ti(n)  3 . . 

+ k-)  (— ) - 6(~}  (t7Go)  + 6i(l’n) 

3 n t2(n) 


(3.17) 


L 9 L 2 T (n) 

t.  ,.(n)T.  . (n)  = t.(n)T.(n)  + — (1  - T.(n))  - (— ) 


i+l'  ' l+l' 


1 1 


n 


n t.(n) 


. L 3 1+T7(n)  L 4 T.(n)  3 

- (-27->  - <th57>  + ¥*•"> 

tAn;  1 


4 4 3 

where  6n(i,n)  and  69(i,n)  are  o(L  /n  p ) as  n ->■  00 . Hence, 


'tl(n)  Ln  K 0 Ln  2 Ti(n> 

(tt11  t (n)T  (n)  + AT  (n)-l)  + (-^-) 

qn  pn  11  n 1 n t^n) 


! Ln  3 1+T  (n)  L 4 T (n)  3 

+ *->  - Vl-"» 


L 2 T.(n)  . L 4 T.(n)  3 

r n,  x . 1/  ns  , i \ 


ti(n){2(“)  TO)  + > <t^T)  + 63(i’n»  i=2  > • • • > Kn 


Y.(n) 


n <VX)J 


tK  +l(n)  L 
n , n 


L 2 TK  +l(n)  L 3 1+TK  +l(n) 

, / fr\  ( n L ii\  / n N 

( n’  (t  (n)  " 3("n)  (~2  ~7_) 

V1  lK  +l(n) 

n 


i=K  +1 
n 


. 1,L  4 TK  +l(n>  3 

„ 6V  <tf-rsr>  - VVi-")> 

n 


(3.18) 


where  o3(i,n)  = -(6^(1, n)  + 62(i,n)).  We  note  that  6g( (Kn+2 )/2 ,n)  = 0 
for  all  n.  Let  6^(n)  denote  the  quantity 


Vn),  Ln  Ln  2 Ln  2 Vn)  i 1 3 1+T?(n) 

+ <-?)  OiiT  1 §4>  (^7-» 

11  11  a t (n ) 


tx(n) 


By  the  definition  of  qn>  64(n)  = 0.  Also,  using  (3.9)  and  (C. 7) 


Y'(n)  = T,(n)V‘1(n)  = 2(L  -l)(l+0[— ]2))T'  (n), 

n np 

rn 


Post-multiplying  both  sides  of  the  above  equation  by  V(n),  rearranging 
terms  and  applying  (3.2),  the  proof  of  lemma  3.4  is  complete.  QEI 


Remarks . The  above  result  is  not  surprising  in  view  of  the  work  of  Stephens 
[31].  Working  with  the  complete  set  of  normal  order  statistics,  Stephens 
found  that  the  mean  vector  is  again  an  asymptotic  eigenvector  of  the 
covariance  matrix,  but  this  time  corresponding  to  the  eigenvalue  y = 1/2. 
More  generally,  by  solving  a certain  differential  equation,  Stephens  found 
that  the  sequence  { X ^ = (j+1)  ■*"}  is  a sequence  of  asymptotic  eigenvalues 
for  the  covariance  matrix  of  (Y^,...,Yn>,  with  corresponding  eigenvectors 
{Wj}  whose  components  are  the  Hermite  polynomials  He^(x)  evaluated  at 
the  appropriate  population  quantiles.  Stephens'  method  does  not  yield 
error  terms,  however.  In  our  case,  since  the  eigenvalues  are  approaching 
zero  as  n increases,  the  asymptotic  result  is  only  meaningful  provided 
the  error  terms  approach  zero  at  a faster  rate.  Hence  the  above  method 
of  proof. 

An  Example.  Before  continuing  with  the  proof  of  the  theorem,  we  give  a 

simple  application  of  lemmas  3.3  and  3.4,  using  them  to  show  that  the 

, 2 
symmetric  sum  of  squares  estimator  of  the  variance  a and  its  analog 

*\i 

based  on  Y(n),  suitably  standardized,  have  the  same  asymptotic  distribu- 

2 2 

tion.  Without  loss  of  generality,  we  take  a = 1 as  above.  Let  S (n) 
denote 


It  is  well  know  that  S (n)  is  approximately  N(l,2/n)  for  large  n,  and 


I 


thus,  (n/2)  ' (S  (n)  - 1)  converges  in  law  to  the  standard  normal  distri- 
bution. Now  consider 


K +1 
n 


S2(n)  = (K  +1)"1  l (Y.  - $.) 

n u i 


i=l 


^2  ^ 
Expanding  S (n)  in  a Taylor  series  about  T(n), 


K +1 


^2 


S^(n)  = (K  +1)"1  l T?(n)  + 2(K  +1)_‘L  l T.(n)(Y.  - T.(n)) 
n . , x n . , x xx 

x=l  x=l 


K +1 


n 


-1  r ^ , v/CV  & 


+ (K  +1)"1  l l (6..-(K  +l)~1)(Y.-T.(n))(^.-^.(n)) 
i=l  j=i  -‘-J  n 11  D ] 


K +1  K +1 
n n 


-lw& 


where  6..  = 1 if  i = j and  = 0 otherwise, 
x] 


Using  (C.9),  the  first  term  in  the  above  expression  is  easily  shown  to 
converge  to  1 as  n tends  to  infinity: 


K +1 
n 


K 


(Kn+l)_1  l $2(n)  = (Kn+l)'1[2qjT2(n)  + J T2(n)3 


1-P. 


n 


4p  log(l/p  )(l+o(l))  + / T (x)dx(l+o(l) ) 

rn  n 1 

^n 


4d  log(l/p  )(l+o(l))  + (l-2p  -4p  log(l/p  ))(l+o(l)) 
• n n n n n 


= 1 - o(pnlog(l/pn)). 


The  second  term  is  asymptotically  normal  with  mean  0 and  variance 


4Kfi  T*1  (n)V(n)^(n),  which,  by  lemma  3.4,  can  be  written  for  asymptotic 


-1 


purposes  as  2n  . And  finally,  since  the  third  term  has  expectation 
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o((npn>  ■*")  (by  lemma  3.3)  and  standard  deviation  also  o((npfi)  (by  an 

argument  analogous  to  the  one  used  in  the  proof  of  lemma  3.7  below),  it 

is  o ((np  ) "*■)  as  n ->■  «.  Thus,  (n/2)^'//^(^2(n)  - E(S2(n)})  converges 
P ^ 

2 

in  law  to  the  standard  normal  distribution,  where  E{S  (n)}  converges 
to  1 as  n -v  ®. 

Proceed ;ng  now  with  the  Taylor  series  expansion  of  W(n),  we  have 


[T'(n)V_1(n)T(n)32 


[T*(n)V-1(n)v"1(n)T(n)][T'(n)T(n)]  i=l 


+ l wJ1}(?(n))(Y.-T.(n)) 


(3.19) 


K +1  K +1 
n n 


l l wJ2)(T(n))(Y.-T<.(n))(Y.-?.(n))  + ... 
i=l  j=l  3 11  33 


where  (w^1)(Tf(n))}  denotes  the  set  of  first  partial  derivatives  evaluated 

t\,  2 ) ^ 

at  T(n),  {w„  (T(n))}  the  set  of  second  partials,  ere.  To  simplify 
notation,  we  will  hereafter  denote  the  first  (constant)  term  in  the  expan- 
sion (3.19)  by  S^(n),  the  second  (linear)  term  by  S^n),  the  third  (quad- 

f\j 

ratic)  term  by  S._(n),  and  the  remaining  terms  (W(n)  - S,(n)  - S_(n)  - S„( 
•3  JL  Z o 

by  S,}(n).  In  lemmas  3.5  to  3.7,  we  deal  with  S^(n),  S2(n)  and 
(S3(n)+S^(n))  respectively. 

Lemma  3.5.  As  n -*■  ®, 


S1(n)  = 1 — log(l/pn)  (l+o(D), 

27n  ^n 


(3.20) 


Proof.  Contained  in  the  proof  of  lemma  3.4  is  the  fact  that 


Y'(n)  = 2(L  -l)T'(n)  + e’(n) 
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where  e'(n)  = (e^n), . . . ,eK  +1(n)),  with 

n 


(L  -1)6  (n)(l  + o(l)) 
n 5 


i=l 


ei(n)  = 


, L2(L  -1)  T?(n) 
Inn  l 

3 2 2,  N 

n t^(n) 


(l+o(l))  i=2,3, . . . ,K 


- (L  -l)S_(n)(l+o(l)) 
n 5 


i=K  +1 
n 


and  5,-(n)  = -(Ln/npn)^2(21og(l/pn) J"^2.  Thus,  we  have 


S.  (n)  = 


[2(L  -l)T'(n)?(n)  + e'(n)T(n)]2 
n 


1 [4(L  -l)2!' (n)T(n)  + 4(L  -l)^' (n)e(n)  + e' (n)e(n)T' (n)^(n) 


and,  after  some  simplification. 


1 - S.(n)  = — — -e-n )— ' --n ~ [e,(n)T(n)]2  (1+o(1)) 

1 4n2 

since  4(Ln-l)2[T' (n)T(n)]2  = 4n2(l+o(l))  asymptotically  dominates  the 
denominator  of  S^(n).  Now, 


l\l  -l)2 

e'  (n)e(n)  = 2(Ln-l)  <$5(n)  + jj 

9n 


Kn  T®(n) 

l -T— 

i=2  tT(n) 

l 


4L^log(l/p  ) f n 6,  , 

’ 5 5 ~*H  'p  XT41  (1+°<1)>' 

n p n *n  t (:c) 

n 


Using  the  method  used  in  deriving  (C.5)  to  evaluate  the  aboye  integral 


we  obtain 


, ” T6(x)  , 1 l°S<l/Pn> 

> 77TdX  3 3 — 

p t (X)  » 


5 , -3. 

73  + 0(pn  >’ 

9pn 


so  that 


4L  log (1/p  ) 

e'(n)e(n)  = 0-77  n-  - (l+o(l)), 

27n  p 
*n 


(3.22 


Similarly, 


L2 

e'  (n)T(n)  = ^ — -log(l/p  ) (l+o(l)). 

O Up  II 

*n 


(3.23 


Hence,  since  Lim  xlog  x = 0, 
X-+0 


1 - S1(n)  = q-g-log(l/pn)  (l+o(l)). 

J 27n  p li 

rn 


This  completes  the  proof  of  lemma  3.5. 


Lemma  3.6.  The  quantity  S„(n)  is  asymptotically  normally  distributed 


4 5 3 

with  mean  0 and  variance  (2L  /(27n  p ))(l+o(l)). 
n n 

Proof.  Since  (^  - 'Kn))  is  asymptotically  normal  with  mean  vector  0, 
we  need  only  determine  the  variance  to  complete  the  proof.  Using 
y = (y,,...,y„  .)  as  a dummy  variable  and  letting  D(y)  denote 

Jl  tJ. 


I (y£  - y- > > 

i=l 


we  have 
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= 1 1 [y,r(n)Y,(n)y_3, 

i y Y* (n)y(n)  3y.  D(y)  J^y 


Using  the  facts  that  -r — (y'Y(n)Y1 (n)y)  = 2Y.(n)y'(n)y  and 

dy . l 

3 - 1 

— — D(y)  = 2(y.-y),  the  above  expression  becomes 
3yi  i 


"i(1)<y>  = 


2 Yi(r)YI(n)y  (y'Y(n)Y' (lOyX^  - y.) 

r[ 


Y1 (n)y(n)  d(Y) 


D2(y) 


■] 


(3.24) 


In  particular. 


“i1)(,,<n)1  = ?lnff(-)i:rn?n'':ICYi<-n)  - I'(D)I<n)  Vn)]’ 

Y mwn;  T'  (n)T(n)  T'(n)T(n) 


and  using  lemma  3.4,  we  can  ’write  the  above  expression  for  asymptotic 
purposes  as 


w^Cftn))  = i(Yi(n)  - 


-I-—!-'  Y (n)  ).(l+o(l)). 
T' (n)T(n) 


(3.25) 


Since  y'  (n)T(n)/T'  (n)T(n)  'v  2(L  -1)  and  y (n)  ^ 2(L  -l)T.(n)  as  n °°, 

n i n l 

we  must  be  careful  about  the  smaller  order  terms  in  applying  lemma  3.4  to 
the  quantity  in  parentheses  in  (3.25).  Recalling  expressions  (3.21)  and 
(3.23),  we  can  write 


w^1^(T(n)) 


c ( n ) - 6 (n)T  (n)  . 

no  6 1 1=1 

. L2(L -1)  T?(n) 

< 1 6,(n)T.(n)  i=2,3,...,K  (3.26) 

3 n3  t?(n)  6 1 n 

“ -£<5  (n)  - S.(n)T  Xn)  i=K  +1 

n 5 6 K +1  n 

L n 
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1/2 

where  6g(n)  = (4L3/3n3pn)log(l/pn) . Noting  that  ^(n)  = -qn(21og(l/pn)) 
as  n ■>  » and  using  (3.2),  (L/n)65(n)  = o(66(n)?1(n)),  so  that  the  terms 
involving  6 (n)  in  (3.26)  can  be  neglected  for  asymptotic  purposes,  and 
the  variance  of  $2(n)  is  given  by  the  sum  of  the  following  twelve 
expressions: 


4 , , v.2,  , Ln  ^n*'1  Pn^ 

4qn(log  1/Pn)6e<n)]Sn^IT  t2(n)" 

n L V 

-hc^dog  i/pn>Vnter^l2  ° 


n t“(n) 

_ 3 K _3 . . 

L p n T . (n) 


..  L3p  Hi  T|(n) 

vl/2*  J _J: 


(p  + (i-lM 

* r\  n 


'^<21°e  1/P"r'‘66<n)^)  A t’o,)''"  ’ - ’ »■ 
f^mog  i/Pn)1/24<")n(L  -iyowr .( tm(1-v(i-4> 

n 1 3-*  3 


-|^(2iog  VP  1 


. L5(L  -1)  Kn  T?(n) 
Inn  vi 


9 n7  i=2  t6(n) 
i 


L3  KnTl(n) 


l -i— (Pn-U-D^Xi-P^-'i-l^) 


166(">-|  l -i— (Pn^i-U^U-Pn-U-D-f) 
n x=2  t.(n) 


Kn  T2(n) 


(3.27) 

(3.28) 

(3.29) 

(3.30) 

(3.31) 

(3.32) 

(3.33) 

(3.34) 

(3.35) 


r 5 . i — 1 K m3  > ,_3/  . 
0 L (L  -1)  n n T.(n)T.(n) 
2 n n r v l 3 


l 1 4 1— <P +(i-l)-5)U-p -(j-l)-i) 


(3.36) 


q 7 L L q 

n i=2  j=i+l  t^(n)tj(n) 


n n 


r 3 Kn_1  Kn  T.(n)T?(n)  T?(n)T.(n)  L L 

ils  (n)  I I C— i—  + 4 3 ]<p  Hi-D-iHl-p  (3.37) 

4°6  n i-2  j=i+l  t.(n)t.(n)  t.(n)t.(n)  n n n n 

3n  i 3 i 3 

L _ V1  Kn  T (n)T.(n)  L L 

J,  . E (3-38) 

n i=2  3=i+l  i 3 


We  will  now  show  that  expression  (3.36)  asymptotically  dominates  the 
other  terms  in  the  above  variance.  For  large  n,  expression  (3.36)  is 
approximately 


2 X1’  1>1>  „ .T3(x)T3(v),  , 

q s — J J x(i-y)-r — o^ydx. 


9 5 i i 3,  ,3,  , 

n p x t (x)t  (y) 


Evaluating  the  inner  integral  using  (C.5)  this  becomes 


h\h  -1)  1_Pn 

Inn  C 

9 5 J 

n p_ 


i + j.  (m. 

p t (x)  Pn  p t (x)  p t (x)  _1 


which,  again  applying  (C.5),  is  easily  shown  to  equal 


2 

27— (1+0(P  n)): 


the  first  integral  is  zero  by  a symmetry  argument;  the  second  integral 
-1  2 

contributes  pn  (T^nJ/t^n) ) plus  smaller  order  terms,  and  the  third 

3 

integral  contributes  (l/3)(T1(n)/t1(n))  plus  smaller  order  terms. 
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Applying  (C.7),  we  obtain  the  stated  result. 

Next  we  show  that  the  remaining  eleven  expressions  in  the  variance  of 
4 5 3 

S^(n)  are  o(Ln/n  p ) as  n ->  ». 

Applying  the  definitions,  lemma  3.2,  and  (C.10),  we  see  that 

32  Ln  (l0g  1/pn)2 

[expression  (3.27)]  = ~ -5- (l+o(l)) 

n pn 

,r4.  5 3n 
= o(L  /n  p ) 
n n 

as  n ->  08 . 

Expression  (3.28)  equeals  pn  (expression  (3.27) )j  so  it  is  obviously 
of  smaller  order. 

Adding  expressions  (3.29)  and  (3.30),  applying  definitions,  approxi- 
mating summation  by  integration  and  neglecting  smaller  order  terms,  we 
obtain  the  quantity 


8 

9 


1_.Pn  T3(x) 

/ (l-2x)-y^dx. 

P_  t (x) 


Using  (C.5)  and  (C.7)  to  evaluate  the  above  integral,  the  expression  becomes 


- ~ -~t( l+o(l)) 


6 

n P. 


which  is  obviously  of  smaller  order  than  (3.36)  as  n -»■  ». 

Proceeding  in  exactly  the  same  manner,  we  add  (3.31)  and  (3.32)  and 

4 2 5 

obtain  an  expression  of  order  L^dog  l/?n)  /n  , vrhich  is  again  of  smaller 
order  than  (3.36) 

Approximating  summation  by  integration  and  neglecting  smaller  order 
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terms,  expression  (3.33)  becomes 


5 1-p 


B J W£i 

n P. 


n 


t6(x) 


dx. 


Integrating  by  parts  twice  and  applying  (C.5),  the  above  expression  is 
easily  shown  to  equal 


< l+0(pn». 


n p 

c r 


and  this  is  again  of  smaller  order  than  (3.36). 


Since  (expression  (3.34))  = O(pnlog(l/pn)expression  (3.33)),  it  follows 


that  (3.3m)  is  of  smaller  order  than  (3.36). 

Asymptotically,  expression  (3.35)  is  approximately 


-/^(log  l/pn)*  / x(l-x)^x, 


T (x). 


n P. 


t‘(x) 


6 2 7 2 

which  is  CKL^dog  l/pn)  /n  p^)  as  n -*  °°.  Thus,  expression  (3.35)  is 


o(expression  (3.36)), 


Finally,  since  (L  /n  p )<5_(n)  is  o(n  6 (n))  as  n ^ showing 
n n 6 o 


that  expression  (3.38)  is  of  smaller  order  than  (3.36)  will  imply  the 
same  for  (3.37).  But,  (3.38)  is  approximately 


1-p  1-p 

n rn 


16  n , , , , ,2  r“  r “ ,,  NT(x)T(y),  , 

;(iog  i/pn)  j I 

n Pn  x 


9 5 2' 

n Pr 


Evaluating  the  inner  integral  this  becomes 


l_T2(log  1/p„)2 


*•  y\ 

Cp  T*(n)+l-p  -T  tl(n)tK  U(n)]  ! !TrfTi>t 
" n p„ 


3-”Pn  3 ^.-p  ^.-p 

• / x(l-x)T + J xT2(x)dx  - J x2  £~|jdx 


It  is  easily  shown  that  the  expression  in  braces  is  0(1).  Hence, 

i£ 

(expression  (3.38))  = °(— 5-3P  0-°g  1/p  ) ) = ©(expression  (3.36))  as 

n p 
*n 

n ->  «.  Summarizing,  we  have  shown  that  the  variance  of  82(0)  is  given 
4 5 3 

by  (2Ln/(27n  pn))(l+o(l)),  which  completes  the  proof  of  lemma  3.6.  QED 

2 5/23 12 

Lemma  3.7.  The  quantity  S^(n)  + S^(n)  is  o^L^/n  / Pn  ) ££  n 

Proof.  We  first  consider  Sg(n).  Differentiating  (3.24)  with  respect  to 
y.  and  simplifying  terms,  we  obtain  the  following  expression  for  the 

*\j 

second  partial  derivatives  of  W(n): 


M2)  ,V 

w. . (y) 

13 


Y’  (n)Y(n) 


Y* (n)y.(n)  2y'y(n)y' (n)y[y .-y.][y .-y. 3 

2D(y)  D3(y) 


Y£  ( n ) [y • -y . 3 +Y . ( n ) [y . -y . 3*y^-(— C 6 . . - ( K +1 ) 13 

_Yi  (n)v— 3 3 ± £ ±3 2 

- 7 S2(?) 


(3.39) 


In  particular. 
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r 


tf?)(»(n))= 

13  Y (n)Y(n) 


Y.(n)Y.(n)  2T.(n)T.(n)[T' (n)Y(n)]2 

_ 3 + i 3 


2T'(n)?(n) 


CTUnJ^n)]3 


Y • (n )¥ . (n  )+y  • (n  )ft.(n  )+Y - ^-^-[6. .- ( K +l ) 1] 
Yr (n)^(n)— 3 3 _L  13 


[T'(n)?(n)]2 

Using  the  result  of  lemma  3.4,  we  can  write  the  above  expression  for 
asymptotic  purposes  as 


wf?^(¥(n))  = 2(L  /n)[(L  /ntf.U)?  (n)  + (K  +1)-1  - 6..].  (3.40) 
13  n n x j n 13 


?! 


Since 


K +1  K +1 
n n 


E{S3(n)} 


(1/2)  l l w^2)(T(n))v  .(n), 
i=l  j=l  J J 


it  is  straightforward  to  show  using  (3.6),  (3.40)  and  the  methods  of  the 
proof  of  lemma  3.6  that  the  asymptotically  dominating  term  in  the  above 
expectation  is 

_ L L 

IT  n (p  +(i-l)-£)(l-p  -(i-1)— ) 

n r,  ti  n n n 

n2(L  -1)  i=2 


t?(n) 

i 


which  becomes 


n ^ / x(l-x)t  2(x)dx  (l+o(l)). 


Evaluating  the  above  integral,  we  find  that 


E{S3Cn)}  = 2n  1log(l/pn)  (l+o(l)) 


(3.41) 


'Al  * ' 


WB 
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as  n ■*  °°. 

Next,  we  obtain  an  upper  bound  on  the  standard  deviation  of  S^Cn). 
To  do  so,  we  use  the  fact  that  for  any  random  variables  x^jX^, . . . ,xn, 

n n 

o(  l X.  ) 4 l o(x.).  (3.42) 

i=l  1 i=l  1 

2 

Note  that  if  X is  normally  distributed  with  mean  0 and  variance  a , 


E{X2m)  = (2m-l)(2m-3)...(l)a2m 


so  that 


VaitC^-^.CnJX^-^Cn))}  < E{^i-Ti(n))2(Yj-Tj(n))2} 


< [E{Y  .-T^n)  )4}E{ (n) )4}]1/2 


= 3var{  Yi)Var{^  } 


-2 


[Lnlog(l/pn)] 


[nLnlog(l/pn)tj(n)]  1 


<.  const/ 


[nLnlog(l/pn)t2(n)]  1 


[nt^(n)t. (n)] 


-2 


i,je{l,Kn+l) 


ie{l,K  +1} 
n 

j«l,Kn+l} 


(3.43) 


i(f{l,K  +1} 
je{l,K  +1} 


i,j«Kl,Kn+l} 


where  we  have  used  lemma  3.3  and  the  definition  (3.6).  Thus,  from  (3.40), 
(3.42),  and  (3.43) 


'■  gqWWCTCWft  B»pi 
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K +1  K +1 
n n 


a{So(n)}  <_  l (ftn))|0{(VVn)>(V^(n>)} 

o . _ . - 11  11  J J 

1=1  1=1  J J J 


= c1[nlog(l/pn)]“1  + c2L^//2n"3,/2log(l/pn)  + c3n  1(log  l/pn)2 


+ c4CnPnl°g(l/Pn)] 


-1 


where  c^,  c2,  c3,  and  are  0(1)  as  n 00 . The  asymptotically 


dominating  term  in  the  above  expression  is  easily  seen  to  be  the  term 


-1, 


associated  with  cu;  that  is,  c{S.(n)}  <.  cuCnp  log(l/p  )]  (l+o(l))  as 


5 5 3 J = 4 rn  w ' *‘n’ 

n ■+  r».  But,  the  latter  expression  can  be  written  as 


r 2 3/2  1/2 

L n p 
n *n 


-1, 


C4  5/2  3/2  2 (1°g  1/Pn)  (1+o(1)) 


n P 


n 


2 5/2  3/2 

Applying  (3.’4),  we  see  that  a{S«(n)}  = o(L/n/p  ).  Similarly,  from 

o n n 


*.>.41),  E{S„(n)}  is  also  o(L2/n5//2p3^2 ) . Thus,  the  term  S»(n)  is 
o n rn  o 


o (o(n))  as  desired. 
P 


We  now  deal  with  the  higher  order  terms  in  the  expansion  (3.19). 
Applying  (3.42),  the  standard  deviation  of  the  kth  order  term  (k=3,4, . . . ) 
is  less  than  or  equal  to 


K +1  K +1  K +1 

l  "l  • • . "l  , <¥(n))|o{?.  <»»...<!  -»  <n)». 


i,  =1  i0=l  i,  =1  12’  k 

12  k 


We  will  find  upper  bounds  on  the  quantities  in  the  above  summation  which 


2 5/2  3/2 

are  sufficient  to  show  that  the  sum  is  o (L  /n  p ) as  n 00 . 

p n rn 


To  simplify  notation  for  the  remainder  of  the  proof  we  define 


A(V)  - CY'(n)y] 

My;  y*  (n)y(n) 


K +1 
n 

B(y)  = I (y.-y.)‘ 
i=l 


A1  (y)  - MM  |* 

i 'y 


Bl<*>  ■ Tpf  ly 


(X)  = OJ.y 1 


B'.'.(y)  = iLMzi  1^, 

aijiy;  8y.3y.  ly 


for  i,j  = 1,2,...,K  +1.  We  note  that  all  higher  order  derivatives  of 
A(y)  and  B(y)  are  zero.  We  have  shown  previously  using  lemma  3.4  that 


A1 (T(n) ) L 

|— 1 4 const  (log  1/p  r' 

A(ftn))  n n 


Bl(T(n))  L ./0 

|— 1 4 const  (log  1/p  r' 

B(T(n) ) n n 


A"  (T(n))  L 

| 1 4 const  (log  1/p  ) 

A(T(n) ) n n 


(3.44) 


B"  (T(n)) 

1 

B(T(n) ) 


const (L  /n) 
n 


if  i=j 


const(L  /n)  if  i/j 

n 


as  n + ®. 


Now  let  R(y)  = log(A(y)/B(y) ) . It  is  easily  seen  that  the  general 

Ck) 

kth  partial  derivative  of  R(y),  which  we  will  denote  by  R.  . . (y), 


involves  only  sums  of  products  of  the  forms 


A",  , (y)  A"  (y) 

. r J1J2  -,r  J3J4  . 
COnSt  C~  "~A(y ) 


A'.'  . (y)  At  , . A*.  , , 

^nn-rV  •)m1+l  y ^nu+m  y 

1 r ± ± -ir = ir - - ~i 

L A(y)  JL  A(y)  JL  A(.y)  J 


B'J  , (y)  B'J  (y) 

, r J1J2  , r 33J4 
COnSt  C'  B(y)  3 C~B(7r":! 


BV  . (y)  Bi  (y)  B'  (y) 

^mg-l^mg  3m3+1  ^mg+m^ 

- B(y)  3 [ B(y ) 3 C B(y) 


where  m1  + m2  = m3  + m^  = k.  But,  by  applying  the  definitions  and 
rearranging  terms,  we  also  have 


wPcfon)) 


= S1(n)R^1)(T(n)) 


w^An)) 


S (n){R^l)(T(n))R(.1)(T(n))  + R?  2)($(n) )} 

1 i D ID 

r ^ 

R^1)(^(n))R^)(^f(n))  + R^1)(^(n))R^)(?(n))  + 


w^Cftn))  = S,(n)J  R,U)(T(r:)R(2)(T(n))  t R^tf(n))  + 


R^1)(T(n))Ri1)(T(n))R[1)(T(n)) 
i D K 


(3.45) 


terms  involving  products  of  derivaties  of 

R(y),  where  the  sum  of  the  orders  of  the 

wfk?  . (T(n))  = S.(n)J  . . ..  . . . . . . . ■ 

l^ig...^  1 < derivatives  m each  product  is  k and  each 

index  (i.e.  i^,i2,...)  is  represented  exactly 

once  in  each  product 


Thus,  since  S^(n)  1 as  n ■*  °°, 
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l”iki  i (^(n))!  4 C-^iogU/p)]5* 
2 ‘ ’ Xk  n n 


(3.46) 


where  s*  = max{(k-s),[~^]} 

and  s = the  no.  of  equalities  among  the  indices  i^,^,  * * * ’ifc* 

Next  we  turn  to  the  standard  deviations  of  the  (Y.  ) terms  in 

X1  xk 

the  expansion  (3.19).  The  argument  used  in  obtaining  (3.43)  also  yields 
the  following  fact  for  k=3,4,5...: 


Var{(Y.  (n))(Y.  (n))...(Y.  -T.(n))}  < 


^l 


x2  x2 


Xk  xk 


(3.47) 


,-1 


Var((^.  -^.(n)). . .(^.  (n))  • const 


X1  X1 


Vi  Vi 


[L  log(l/p  )]  x i,e{l,K  +1} 
n n k n 


Cnpnlog(l/pn)]“  ' i^tl.K^l) 


For  given  k,  let  I^(r,m,s,s*)  denote  the  set  of  integers  r,m,v,s;’ 
which  satisfy 

0 < m < k 

max(0,m-2}  < r < max{m-l,0} 

r £ s <_  r + max{k-iv-l,0) 
s*  = max{(k-s),C^~ ■]} 

Then,  combining  (3.46)  and  (3.47)  we  obtain 


K +1  K +1  K +1 
n n n 


l l •••  I 1^! 

i,=l  L=1  i,  =1  12' 


1 - 2 


. (T(n) ) |a{ (?.  (n)) , 

' xk  X1  X1 


(i  -T.  ( n ) ) } 


ik  xk 


k-s-m+r  L s*-(m/2)  s*-(k/2)  -k/2  (m-k)/2 

<.  const  • max  { (K  +1)  (— ) (log  1/p  ) n p 

— n n n n 

Ik(r,m,s,s5-) 

for  k=3,4,...  (where  m represents  the  number  of  indices  (i^,...,ik) 
which  assume  values  1 or  k^+1;  r represents  the  number  of  equalities 
among  those  indices,  and  s and  s*  are  as  defined  above).  By  separ- 
ately considering  each  of  the  four  cases:  (1)  k even,  s >.  k/2; 

(2)  k even,  s < k/2;  (3)  k odd,  s >.  (k-l)/2;  and  (4)  k odd,  s < (k-l)/2, 
it  is  straightforward  to  show  that  the  right  hand  side  above  is 
asymptotically  maximized  by  the  following  parameter  settings: 


r 


m=r=0,  s=l,2,or  3 

if 

CO 

il 

^ k+1 

m=r+l=s+l=-2— 

if 

k>3 

and 

odd 

, k+2 

m=r+l=s+l=— 7T- 
v ^ 

if 

k>3 

and 

even 

in  which  cases  it  becomes 


log(l/pn) 

3~3 
n p 

* n 


1/2 


const. 


log( 1/ p ) 1/4 

I [ k-3  k+3  k+l] 

\ F.  n r\ 


L n p 
n rn 


rTk-2  k-2  k+2n 

LL  p n j 
n rn 


if  k=3 


if  k>3  and  odd 


-1/4 


if  k>3  and  even 


Successive  terms  of  the  above  expression  are  decreasing  in  k at  a 
-1/2 

rate  which  is  o(L  ),  while  for  k = 3 we  have 
n 


log (1/p  ) 1/2  L2  n(log  1/p  )1/2  L2 

r H_i  n rn  , n ^ 

n3n3  " 5/23/2  T.  = 01l/2  3/2' 

n P„  n P„  L n p 

n n n *n 

as  n tends  to  infinity.  Hence,  a further  application  of  (3.42) 

establishes  that  the  standard  deviation  of  S4(n)  is  o(c(n)). 

Using  (3.46)  and  the  formula  analogous  to  (3.43)  for  expectations, 

it  is  easily  shown  in  the  same  manner  that  the  expected  value  of  S4(n) 

is  also  o(o(n))  as  n ->■  Thus,  S.  (n)  is  o (o(n))  as  claimed.  This 

4 p 

completes  the  proof  of  lemma  3.7.  QED 

Combining  the  results  of  lemmas  3.5,  3.6,  and  3.7,  the  proof  of 
Theorem  3.1  is  complete. 

3.2.4  Discussion 

In  this  subsection,  we  explain  the  reason  for  the  definition  (3.5)  of 

the  quantity  discuss  the  possible  relationship  between  W(n)  and  W, 

• a 

including  the  introduction  of  a new  statistic  W:’:(n)  which  is  an  analog 

) 

of  W*  of  Shapiro  and  Francia  [36],  and  finally,  mention  • ne  practical 
considerations  in  choosing  the  parameters  of  our  selected  subset  of  order 
statistics. 

The  quantity  qR.  In  order  to  ensure  the  asymptotic  normality  of  our 
selected  subset  of  order  statistics,  it  was  necessary  to  have  the  "tail 
spacing"  (npfi)  of  the  statistics  different  from  the  "internal  spacing" 
(L^).  A consequence  of  this  fact  was  that  the  elemenis  of  the  inverse 
of  the  asymptotic  covariance  matrix  corresponding  to  the  tail  statistics 
were  too  large,  upsetting  the  "balance"  inherent  in  the  coefficients 
Y(n)  = V ^(n)T(n).  Specifically,  these  coefficients  were  asymptotically 
dominated  by  y^(n),  and,  in  fact,  Y'(n)y(n)  was  asymptotically  dominated 
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* 


'S 


k 


I 


\ 


2 

by  Thus,  for  = 1,  not  only  would  the  value  of  the  statistic 

ft(n)  be  essentially  determined  by  the  value  of  the  tail  statistics  Y^(n) 

and  Y^  +^(n),  but  the  statistic  itself  would  converge  to  zero  instead  of 
n 

one  as  n ■+■  «°,  in  which  case  the  ft-test  would  no  longer  be  consistent. 

1/2 

Even  defining  qn  = (np^/L^)  , its  limiting  form,  had  the  effect  of 

allowing  the  tail  terms  to  dominate  both  the  rate  of  convergence  of  S^(n) 
to  one  and  the  variance  of  S2(n).  Only  by  defining  qn  as  we  did  were 
we  able  to  cause  the  effect  of  the  tail  statistics  to  be  of  smaller  order 
than  the  combined  effect  of  the  remaining  K^-l  statistics. 

The  relationship  between  ft  and  W.  Probably  the  most  intriguing  and  cer- 
tainly the  most  difficult  question  raised  by  the  preceding  work  is  whether 
or  not  we  can  infer  anything  about  the  large  sample  behavior  of  W from 

f\j 

the  asymptotic  distribution  of  W under  the  null  hypothesis  (or  under 
the  alternatives  of  section  3.4).  In  essence,  does  the  form  of  the 
statistic  ft  properly  encode  the  information  about  the  null  hypothesis 
in  order  that  the  asymptotic  sufficiency  result  of  Weiss  [40]  may  be  ex- 
tended to  this  case?  If  we  were  to  take  p and  L to  be  their  values 

*n  n 

if  the  full  set  of  order  statistics  were  being  considered  (i.e. 

-2 

p = 1/n,  L =1),  then  the  variance  of  W(n)  would  be  of  order  n , 
n n 

which  is  what  has  been  conjectured  for  W by  Stephens  [33]  and  others. 
However,  we  of  course  cannot  do  that,  and,  in  fact,  condition  (3.4)  pre- 
cludes any  choice  of  the  parameters  which  would  make  the  variance  that 
small. 

On  the  other  hand,  let  us  define  an  analog  of  the  Shapiro-Francia 
» ^ 

statistic  W"  based  on  Y(n)  (w^th  q =1)  by 

n 


I 


~ ■ i 
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1 


I 


»*<„)  = 

CT'(n)T(n)3S2(n) 


(3.48) 


Also,  let  a*  denote  the  quantity 


r K +1  K +1 
L n n 


l l C(Vn)Ti(n)T-i^)  + (MO"1  - <S.,3v.,(n) 
i=l  j=l  n 1 3 n 13  13 


where  (v^^(n)}  are  given  by  (3.6)  (again  with  = 1).  In  the  proof  of 
lemma  3.7,  a“  was  shown  to  be  0(log  1/Pn)  as  n °°.  In  appendix  B we 
prove  the  following  theorem: 

Theorem  3.2.  Under  the  null  hypothesis, 

n[l  - W*(n)]  - a*  (3.49) 


converges  in  law  to 


T(v) , where 


k=3 


k 


iLtl 

i) 


is  the  distribution  of 


with  ^3 » ^ j • • • i. i.d.  standard  normal  random  variables. 

The  significance  of  Theorem  3.2  is  that  the  analog  W*(n)  of  the  Shapiro- 
Francia  statistic,  or,  more  precisely,  of  the  De  Wet-Venter  r^,  has  the 
same  asymptotic  distribution  (under  Hq  and  up  to  a shift  in  location)  as 
the  corresponding  statistic  based  on  the  full  set  of  order  statistics. 

Thus,  the  form  of  W (n)  does  properly  encode  the  information  about  the 


null  hypothesis.  We  can  conclude,  then,  that  if  ft(n)  does  not  behave 
like  W,  it  is  because  of  differences  between  the  rate  at  which  the  asymp- 
totic eigenvector  result  (lemma  3.4)  takes  hold  for  the  mean  vector  and 

<\j 

covariance  matrix  of  Y as  opposed  to  the  rate  for  the  corresponding  re- 
sult for  the  full  set  of  order  statistics.  Progress  toward  rigorously 
defining  the  relationship  between  W and  W might  proceed  along  that 
line  of  analysis. 

Practical  considerations.  In  practice,  there  are  two  ways  of  choosing 
the  "spacing"  parameter  L : 

(A)  L = n^  where  2/3  < 6 < ’ (such  that  (3.1)  - (3.4)  hold) 

n 

(B)  L = nr^n'>  where  r(n)  c as  n -»■  »,  2/3  < c < 1 (such  that 

n 

(3.1)  - (3.4)  hold). 

For  moderately  large  n,  method  (A)  leads  to  larger  values  of  Ln  and, 
hence,  smaller  values  of  than  method  (B).  The  advantage  ethod 

(A)  is  that  the  asymptotic  theory  takes  over  for  smaller  n.  ""  dis- 
advantages of  method  (A)  are:  (1)  Kn+1,  the  number  of  selected  order 

3 /4 

statistics,  is  very  small  (for  example,  for  Ln  = n and  n = 1000, 

<\j  # # 

(Kn+1)  = 5);  and  (2)  the  statistic  W(n)  is  too  sensitive  to  small  devi- 
ations in  the  tail  values.  Table  3.1  gives  the  asymptotic  means  and 
standard  deviations  for  fr(n)  for  small  changes  in  the  tail  percentiles 
Pn,  l-pn-  Small  changes  in  the  actual  values  of  the  tail  order  statistics 
should  also  have  dramatic  effects. 

Method  (B)  is  preferable  in  that  smaller  values  of  Ln  arise;  that 

'V 

is,  the  asymptotic  variance  of  W(n)  is  smaller  and  we  select  a larger 
subset  of  the  order  statistics.  The  problem  with  this  method  is  that  even 


TABLE  3JL 

ASYMPTOTIC  MEANS  AND  VARIANCES  OF  W(n) 
FOR  VARIOUS  CHOICES  OF  THE  TAIL  QUANTILES 


CASE* 


P(n) 

I 

II 

8(n) 

III 

8(n) 

n 

8(n) 

D(n) 

P(n) 

100 

.503 

.068 

.909 

.033 

.956 

.024 

150 

.544 

.052 

.928 

.024 

.967 

.017 

200 

.575 

.043 

.940 

.019 

.973 

.013 

250 

.599 

.037 

.947 

.015 

.977 

.011 

300 

.619 

.033 

.953 

.013 

! .980 

.009 

350 

.635 

.029 

.957 

.012 

,982 

.008 

400 

.650 

.027 

.961 

.010 

.984 

.007 

450 

.662 

.025 

.964 

.009 

.986 

.006 

500 

.673 

.023 

.966 

.009 

.987 

.006 

550 

.683 

.021 

.968 

.008 

.988 

.005 

600 

.692 

.020 

.970 

.007 

.988 

.005 

650 

.700 

.019 

.972 

.007 

.989 

.005 

700 

.707 

.018 

.973 

.006 

.990 

.004 

750 

.714 

.017 

.974 

.006 

.990 

.004 

800 

.720 

.016 

.975 

.006 

.991 

.004 

850 

.726 

.016 

.976 

.005 

.991 

.004 

900 

.732 

0.15 

.977 

.005 

.992 

.003 

950 

.737 

.015 

.978 

.005 

.992 

.003 

1000 

.741 

.014 

.979 

.005 

.992 

.003 

*Case 

L 

p 

n 

I 

n’75 

n~(l/log  n)1//2 

II 

•75 

n 

n-(l/log  n)2^3 

III 

•75 

n~(l/log  n)3/^ 

n 
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for  moderate  n,  significant  contributions  to  the  mean  and  variance  of 
$(n)  come  from  the  higher  order  terms  in  the  expansion  (3.19)  as  well 
as  the  lower  order  components  of  the  constant  and  linear  terms. 

From  the  practical  point  of  view,  therefore,  the  asymptotic  theory 
is  not  overly  useful  for  moderately  large  n in  actually  specifying 
the  critical  region  of  the  test.  It  is  useful  in  providing  possible  in- 
sight into  the  behavior  of  W,  in  indicating  how  to  choose  and  pn 

to  give  the  best  results  (i.e.  choose  L "small"  and  p "large" — which 

n rn 

f\, 

makes  sense  since  both  the  numerator  and  denominator  of  W(n)  have 

interpretations  as  estimators  of  a ),  and  especially,  in  indicating  how 
% 

W(n)  behaves  under  alternative  distributions  so  that  comparisons  with 
other  test  procedures  may  be  made.  The  latter  results  are  discussed  in 
the  remainder  of  this  chapter. 

3.3  Consistency  of  the  W-Test 

In  this  section  we  show  that  as  n »,  the  power  of  the  sequence  of 
tests  based  on  W(n)  tends  to  one  against  any  fixed  alternative.  Through- 
out this  section  we  assume  that  for  each  n,  the  sample  X^,...,Xn  comes 
from  the  distribution  H(x),  where  H(x)  = $((x-y)/a)  for  any  (p,cr) 
with  a > 0.  In  particular,  we  assume  that  there  exists  some  e > 0 such 
that 

inf  sup  |H  1(p)  - ci>_1(p)  - d|  > e 

c,d  0<p<l  (3.^0) 

c>0 

where  H \p)  is  defined  as  the  smallest  value  of  x such  that  H(x)  >_  p. 
(Note  that  the  degenerate  case  where  the  support  of  H(x)  is  a single  point 


la 


is  ruled  out  by  this  assumption.)  Defining 
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r 


n(x)  = < 


x|6H(x) 


x (l-H(x) ) 


if  x<p 
if  x>0 


we  further  assume  that  n(x)  is  bounded  and  converges  to  zero  as  |x|  -*■  » 
for  some  6 > 0.4.  We  remark  at  the  end  of  this  section  how  this  regu- 
larity condition  may  be  relaxed.  A sufficient  condition  that  it  hold 

6 

(see,  e.g.  Sen  [23])  is  that  E^{|x|0}  is  finite.  Our  proof  of  consis- 
tency requires  the  following  lemma: 


Lemma  3.8.  There  are  points  p* , p* , p* , with  0 < pf  < 1 for  i=l,2.,3, 
non- overlapping  open  intervals  If  = (pf-6*,pf+fi*)  with  6*  > 0, 
and  some  e*  > 0 such  that  for  any  (c,d)  with  c > 0 , 


inf|H  \p)  - c$  ^(p)  - d|  > e*  (3.51) 

pelf 


for  at  least  one  i=l,2,3. 

Proof.  We  prove  the  result  when  H(x)  is  discrete.  The  modifications 
required  when  H(x)  is  continuous  are  pointed  out  parenthetically  as 
we  proceed. 

Let  x^  and  (x^  < x^)  be  any  two  jump  points  of  H(x)  and 

choose  pf,  i=l,2,  such  that  H(x^-)  < pf  < H(x^),  where  H(x-)  denotes 
the  limit  from  the  left.  (In  the  continuous  case,  we  choose  pf  such 
that  0 < pf  < p*  < 1 and  H \p)  is  strictly  increasing  for  p in 

A /V 

some  neighborhood  of  pf.)  Let  c and  d be  the  uniquely  determined 


*’B\n  ^ 
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values  such  that  <K(x^-d)/c)  = p'£  for  i=l,2.  Then,  from  (3.50),  there 
exists  p*,  0 < p*  < 1,  such  that 


| H 1(p*)  - c4>  1(p‘g) 


d|  > e . 


I : 

5 

i 3 


i 

s 


A 2 A 

Furthermore,  the  continuity  of  c4>  (p)  + d implies  that  we  can  "shift" 

p*  slightly  if  necessary  so  that  H(Xg-)  < p*  < H(Xg)  for  some  jump  point 
x^  of  H(x),  possibly  at  the  expense  of  replacing  e by  a smaller 
positive  quantity  e'  (say).  (In  the  continuous  case,  we  can  have  H ^(p) 
strictly  increasing  in  a neighborhood  of  p* . ) 

Now  consider 

inf  max  |h  ^(pv)  - c4>  ^(x.)  - rtj. 
c,d  i=l,2 ,3  1 1 

c>0 

By  the  way  the  p|  were  chosen,  the  value  of  this  quantity  must  be  strictly 
positive.  We  will  denote  the  value  by  2e*. 

We  now  show  how  the  intervals  Iv  are  determined.  We  consider  only 
the  case  where  x1  < x2  < and  1/2  < p*  < p*  < 1.  The  other  cases  are 
similar.  The  situation  is  depicted  in  Figure  3.2.  The  shaded  boxes  are 
the  regions  where  (3.51)  is  satisfied;  that  is,  (3.51)  holds  whenever  the 
curve  <K(x-d)/c)  passes  through  no  more  than  two  of  the  three  boxes.  By 
the  definition  of  e*,  any  normal  curve  must  pass  through  or  outside  at 
least  one  of  the  points  (x^+e;’{,p^) , i=l,2,  or  3.  These  points  are  denoted 
by  * in  the  figure.  Since  the  slope  of  <S>((x-d)/c)  is  decreasing  in  x, 
the  worst  possible  case  is  that  c and  d satisfy 
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x - ( e*/2 ) - d 

<5>( ) = p*  + 6* 

c 1 


X - e«  - d 

*( — 3 — } = P3 


Adding  the  additional  equation 


x3  - (e*/2)  - d 36* 

*( 5 } = P3  + ~ 


solving  for  6*,  and  taking  <5v  = 6*,  i=l,2,3,  thus  yields  the  desired 
intervals.  (In  the  continuous  case,  the  shaded  boxes  become  cylinders  of 
width  e*  about  the  curve  H(x)  and  the  slope  of  H(x)  in  each  of  the 
neighborhoods  must,  therefore,  also  enter  the  calculation,  but  the  method 
is  the  same.)  This  completes  the  proof  of  lemma  3.8.  QED 

Now  let  J'v(n),  i=l,2.3,  denote  the  set 

{ j | pv  - <5*/2  £ Pn  + (j-l)Ln/n  4 pv  + 6*/2}. 


Note  that  |jv(n)|  = largest  integer  <.  6*n/L  . We  now  show  that  (3.51) 
implies  that  for  at  least  one  i=l,2,3, 


Lim  P{  min  | Y . - c^.(n)  - d|  >.  e*}  = 

n-x»  jeJ5)(n)  ^ 


(3.52) 


First  assume  that  H(x)  is  discrete.  By  the  way  the  pv  were  chosen,  we 
may  assume  without  loss  of  generality  that  H(x^~)  < pv  - 6*/2  < pv  + 6*/2  < 
H(x^)  (since  otherwise  we  just  replace  Iv  by  its  intersection  with 
(H(x^-),H(x^))).  Thus,  je  Jv(n)  implies  Y.  = x^  with  probability 
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approaching  one  as  n -»■  ».  Thus,  (3.52)  follows  immediately  from  (3.51) 
for  Lhis  case.  Now  assume  that  H(x)  is  continuous.  We  may  again  assume 
without  loss  of  generality  that  H ^(p)  is  strictly  increasing  on  Iv. 

Lot  M^(n)  denote  the  quantity 


v£(H(Y.) 


nP. 


(5-Di*, 


j e J5)(n). 


It  is  well  known  that  max  I M . . (n ) I 

jej£(n)  13 

Writing 


is  finite  with  probability  one. 


Y. 

D 


I M..(n)  np  +( j-l)L 
= H-1(— + — 2 H) 


n 


we  see  that  je  Jv(n)  implies  that  the  argument  of  H 1 above  is  in  Iv 
with  probability  approaching  one  as  n increases.  Applying  (3.51),  we 
again  obtain  (3.52). 

1/2 

For  i=l,2,. . . ,K  +1,  let  Y‘£(n)  denote  the  quantity  y^iO/Cy*  (n)Y (h)] 

i\j 

(where  y(n)  = T’(n)V  (n)  as  previously  defined).  Consider  the  quantity 


K +1 

“l  M - un(^,  - ?.)]2 

j=l  3 3 


(3.53) 


where  u is  defined  as 
n 


( 


K +1  K +1 


(3.53) 


It  is  easily  seen  that  (3.53)  is  equal  to  1 - W(n).  In  lemma  3.4  it  was 

'v#  -1/2 

shown  that  Y?(n)  = T.(n)(l+0(L  /np  ))(K  +1)  : hence  (3.53)  can  also 

j ] n n n 


i 
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be  written  as 


u2  "l  [T.(n)(l+0(I>n/npn))(u^(Kn+l)~*/2)  - * ?.]2. 

j=l 


Applying  lemma  3.8. 


1 - W(n)  >,  u2  l l [^.(n)(l+0(Ln/npn))(u^(Kn+l)  1/2  " Yj  + Y-^ 
n i=l  jeJv  3 


>,  6*(e*)2(nu2/Ln) 


with  probability  approaching  1 as  n ■*  °°.  Also, 


(nu2/L  ) > (K  +l)u2  = W(n)/S2(n) 
n n n n 


'V/O 

with  S (n)  as  defined  in  section  3.2.  Thus, 


1 - ft(n)  > 2 6“(ei;-- 2 

“ §2(n)  + 6*  (e*) 


(3.54) 


with  probability  approaching  one  as  n + “.  Since  the  sequence  of  critical 

points  for  the  W-test  approaches  one  at  the  rate  n pn/(Lnlog(l/pn) ) (see 

lemma  3.5  and  Figure  3.1),  consistency  will  follow  immediately  once  we 

establish  that  S2(n)  is  op(n4pJ/(L4log(l/pn) ) ) as  n 

We  now  return  to  the  function  q(x).  Our  initial  assumption  implies 

that  n(Yj  is  bounded  for  1 < i < Kn+1*  Using  this  fact  together  with 

the  fact  that  /ntH^Y^.)  - H(Y\ ))  is  finite  with  probability  one  (where 

H (x)  is  the  empirical  c.d.f.  and  H(x)  may  be  discrete  or  continuous), 
n 


I 

! 


L10 


it  follows  that  the  function 


5(p  +(i-l)L  /n) 
rn  n 

5(l-p  -(i-l)L  /n) 
n n 


if  p +(i-l)L  /n  <.  H(0) 
n n — 

if  p +(i-l)L  /n  > H(0) 
rn  n 


(3.55) 


is  finite  with  probability  one.  If  H(x)  is  continuous,  (3.55)  is  immedi- 

-1/2  -1 

ately  implied  by  the  fact  that  n pn  converges  to  zero  as  n 00 . 

If  H(x)  is  discrete  and  <.  0,  then  H^(Y^)  <,  pn+(i-l)lWn,  from  which 
the  first  half  of  (3.55)  follows.  Recalling  our  assumption  on  the  tails 

t \j  'b 

of  H(x),  for  Y.  > 0 and  H(x)  discrete,  l-pn-(i-l)Ln/n  = Hn(Y^)  + 67(n), 

'Xi 

where  6?(n)  is  O(Y^)  and,  in  fact,  converges  to  zero  as  the  sample 

r\j 

quantile  Y^  assumes  arbitrarily  large  values.  Thus,  (3.55)  is  established 
and  for  any  3 > 0,  we  can  find  a constant  M(g)  such  that  (expression 
(3.55))  < M(8)>  >1-8. 

Finally,  with  probability  greater  than  1-8, 


'X’r 


K +1 

-1  "v  (V2 


S^(n)  < (Kn +1)"  l i. 
— n » — — i 

i=l 


“FTT  .1  . I 

n i=l  i=i'*+l 


/n] 


-2/6 


(where  i*  is  defined  by  pn+(i*-l)Ln/n  < H(0)  < Pn+i*Ln/n) 


< 2- 


H(g) 

K +1 
n 


i*+l  O /X 

l Cpn+(i-l)Ln/n]“2/6 


< 2M(8)p^2/<S  . 


— 


Ill 


But,  using  (3.2), 


Lnl°g(l/Pn)  _2/5 

4 3 pn 

n Pr, 


n 


4 8,,  , , ,4 

n p (log  l/pn) 


5-(2/6), , , , J 

> (log  1/p  ) 


which  converges  to  zero  for  <5  > 0.4.  Thus,  since  B can  be  made  arbi- 
trarily small,  £p(n)  is  o (n4p^/(L4log(l/p  )))  as  required. 

p n n “ n 

Summarizing  the  above,  we  have  proved 

Theorem  3.3.  Under  the  above  assumptions,  the  ft- test  is  consistent . 

We  remark  that  the  same  proof  immediately  yields  the  consistency  of 

the  test  based  on  W*.  The  only  change  above  is  that  Y‘£(n)  = T^(n).  Also, 

the  restriction  6 > 0.4  can  be  relaxed  by  placing  tighter  controls  on  the 

tail  quantiles  (pn  and  1-p^).  For  example,  if  instead  of  (3.2)  we 

assume  that  lim  L /np4  = 0,  then  theorem  3.3  follows  for  6 > 2/9,  and 
n-**>  n rn 

so  forth. 

<v> 

3.4  The  Asymptotic  Power  of  the  W-Test 

We  wish  to  develop  a means  of  comparing  the  W-test  with  other  known 
tests  for  normality.  As  in  the  previous  chapter,  we  do  so  by  defining  a 
general  class  of  alternatives  which  approaches  <Kx)  as  n increases 
and  determining  the  exact  rate  and  character  of  approach  which  guarantees 
a non- trivial  power. 

The  sequence  of  alternative  distributions  which  we  will  study  has 
the  form 


H (x)  = <5>(x)  + e (x)  (3.56) 

n n 

where  the  disturbance  functions  e (x)  satisfy  the  following  conditions: 

n 


s”  (x)  exists  and  |e”(x)|  < D < °»  for  all  x e (-“>“)  (3.57) 

-<5>(x)  < e (x)  < 1 - 4>(x)  for  all  x e (-»,»)  (3.58) 

— n — 


f e1 (x)dx  = 0 
n 

-00 


(3.59) 


e^(x)  >.  -4>( x ) for  all  x e (-“,“).  (3.60) 


The  above  definition  and  assumptions  are  (2.32)  - (2.37)  specialized  to 


the  normal  case. 


Denote  by  p(n,en)  the  quantity 


•n  e (T(x)) 


/ c-  W - C M 


n e (T(x) ) 2 


1’pi>  T(x)  2 

1/  igk1™*" 


t i(^,2(1/PV(>0dxTnO3£n(T(x)Mx  -1|P"  iH.1/"  £&„<«*»«• 

P P P„  t (x)  p 

*11  Fn  n n 


Then,  in  order  to  guarantee  a power  of  the  W-test  between  the  level  a 
and  1,  we  require: 


5/2  3/2 

n pn 

Lim  5 p(n,e  ) 

n-*°  ^2  n 

n 


(3.61) 


for  some  constant  d*,  0 < d*  < »,  and,  letting 


M i 


C(n)  = max{ 


L^(log  1/p  )3  n3/V/2log(l/p  ) 
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en(T(x)) 

Lim  b(n)  sut>  |— — ! 


P <X<.L-p 

*n  *n 


(3.62) 


L^m  b(n)(log  1/p  ) sup  |— ^ 

Pn<X<1"Pn 


e'(T(x)) 

JW~I  = o 


(3.63) 


for  some  increasing  sequence  b(n)  such  that 


Lim  b(n)  = °°,  Lim  C(n)b  (n)  = 0. 
n-**-  ’ n-^oo 


The  motivation  for  the  condition  (3.61)  will  become  evident  in  the 
course  of  the  derivation  of  the  asymptotic  power  of  the  test.  The 
quantity  p(n,en)  is  what  might  be  termed  the  "characteristic  metric" 
for  the  test;  the  relation  (3.61)  requires  that  the  sequence  (H^tx)} 
converge  to  $(x)  according  to  this  metric  at  a rate  proportional  to 
the  asymptotic  standard  deviation  of  v/(n ) under  the  null  hypothesis. 
Heuristically,  the  conditions  (3.62)  and  (3.63)  insure  that  the  alterna- 
tive distributions  do  not  oscillate  too  badly  in  the  interval  (pn,l-pn). 
When  (3.61)  holds,  (3.62)  and  (3.63)  will  also  hold  in  most  cases,  although 
an  additional  restriction  on  the  parameters  ^n’^n,pn^  mav  resu^’t' 

The  remainder  of  this  section  is  devoted  to  proving  the  following 
theorem : 


Theorem  3.4.  Under  the  sequence  of  distributions  (3.56) , the  asymptotic 

_ i /a 

power  of  the  w-test  is  given  by  $((27/2)  ' d*  + c*  „). 

5 a 

We  divide  the  proof  into  two  lemmas.  The  first  introduces  a sequence 
of  distributions  {ff  (x)}  which  will  simplify  computations  in  the  sequel. 

The  second  contains  the  asymptotic  distribution  of  W(.n)  under  the  sequence 
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of  alternatives. 


Denote  the  function  T(x)  - e (T(x))/t(x)  (p  < x < 1-p  ) by  B (x), 

n n = — n n 

with  derivative 


, e (?(x) )T(x)  e' (T(x) ) 

BA<X>  ■ * (x>[1  - TKJ 


o. 


Also,  let  ^(x)  denote  the  function  given  by 


^ aln(x) 


0 < x < p. 


n 


An(x)  = < Bn(x) 


< x < 1-p 
rn  *n 


, a.  (x)  1-p  < x < p 

Vv,  2n  *n  *n 


where  a^n(x)  and  a0ri(x)  are  continuous,  differentiable  functions  which 


2n 


satisfy 


aln(pn>  = W 


?2n(1-pn)  = B„(1-P„) 


aL<Pn>  = B'n{*n) 


a'  (1-p  ) = B' (1-p  ) 
2n  *n  n *n 


Lim  a,  (x)  = -» 
v,,  ln 
x->0 


Lim  a„  (x)  = +«> 
2n 
x-+l 


alntx)  ^a2n^x^  strictly  increasing  on  (0^)  ( (l-p^,!)  ). 


We  will  not  need  explicit  expressions  for  aln(x)  and  a2n^x^’  ^ 


is  clear  that  such  functions  exist. 


Thus,  Afi(x)  approaches  -®  as  x ■+  0 and  +oo  as  x ■+  1.  Further- 


more, by  (3.62)  and  (3.63),  there  exists  some  N such  that  for  all 

o 
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n > N , B (x)  (and  hence  A (x))  is  strictly  increasing  on  [p  ,1-p  3 
on  n n n 

((0,1)).  Since  we  are  concerned  only  with  asymptotic  results,  we  lose 
no  generality  by  assuming  that  n > Nq,  and  we  do  so  from  now  on.  In 
particular,  we  can  define  an  absolutely  continuous  c.d.f.  H^ty)  by 
specifying  its  inverse: 


H 1(x)  = A (x)  (0  < x < 1). 

n n 


We  denote  the  corresponding  density  by  h^(y). 


Lemma 


a 3.9.  Under  the  above  assumptions,  {H.^(y)}  and  { ( y ) } are 
asymptotical] y indistinguishable . 


Proof.  Let  { Y } be  a sequence  of  random  variables  with  c.d.f.'s  {H  (y)}. 
n n 

We  must  show  that  h (Y  )/h  (Y  ) converges  stochastically  to  1 as  n «>. 

n n n n 

For  each  n,  let  X be  a random  variable  with  distribution  H (A  (x)), 
n n n 

0 < x < 1.  Then,  the  random  variable  Y*  = A (X  ) has  c.d.f.  H (x): 

n n n n 

thus  we  have  the  equivalent  condition  that  \( An(X^) )/hn(An(Xn))  converges 
stochastically  to  1 as  n -»■  00 . 

Let  En(n)  denote  the  event  {p  < X < l-p  } and  E.(n)  denote 
1 *n  n rn  2 

its  complement.  It  is  easily  shown  that  P„  -r  (E  (n)}  conyerges  to  1 as 

ii  n 

n Given  E (n),  h (A  (X  ))  = [¥'  (X  )3_1  and 

x n n n n n 


.*W)r,  *n(T(V>T<V 

r fj  fy  u too  " t(x  ) ~ t(X  ) JL1  + -— — —3. 

hnan(Xn)}  n n n *(VXn}) 


Assumptions  (3.62)  and  (3.63)  imply  that  the  expressions  in  brackets 
converge  to  1 as  n -»•  “.  Furthermore, 
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<KA  (X  )) 


n'  n' ' , _ J2.,v  ^ t2/-v  \ - D D_  _ r_ii “ — ] 

L j_/v  \ J 


21cg(  -%Y>  = ^(X„)  - A'(Xn)  = 2 ^ 

n n 


e_(T(X_))T(.X_)  2 


t(X„) 


when  E1(n)  occurs,  and  the  latter  expression  converges  to  zero  as 

n -*■  00 . Thus,  let  {e  ; m=l,2,...}  be  any  null  sequence.  Then,  the  above 

m 

arguments  imply  that  for  each  m,  there  is  some  such  that  if  n > N^, 


h_(A_(Xj) 

i - — z 1 < em  I E^n)}  = !• 


P - { } 1 n n n 
HA  1 


n n h (A  (X  )) 
n n n 


Finally,  let  {6^}  be  another  null  sequence;  let  Nm  by  such  that  n>N 


m 


implies  P„  -r  {E, (n)}  > 1-6  . Then,  n > max{N  ,N  ,N  } implies 
v H_A  1 m = o’  m’  m r 

h (I  (X_ )) 


n n 


Pu  T { 1 1 - - J < e } > 1 - 5 . 

HA1  r ■ fT  fv  m m 

n n h ( A ( X ) ) 

n n n 


VJe  have  shown  that  h (A  (X  ))/h  (A  (X  ))  converges  stochastically  to  1 

nnnnnn 


as  desired. 


QED 


Thus,  for  all  asymptotic  probability  calculations,  H^Cx)  can  be  used 
in  place  of  H^Cx).  This  will  lead  to  more  useful  expressions  from  a 
computational  standpoint. 

Let  T(n)  denote  the  (K^+D-vector  whose  elements  T^(n)  are  given 

by 


T.  (n)  = 


V,n’1<Pn*(i-1)-ir> 


H _1(p  *t(  i— 1 ) ”) 
n *n  n 


i=l,K  +1 
n 


i=2 ,3 , . . . ,K 


A. 


. ^ S.iMl  i ii 
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and  let  t(n)  denote  the  (K  +l)-vector  with  elements  t.(n)  = 

n 1 


h (H  ^(p  +(i-l)L  /n)),  i=l,2,...,K  +1.  Also,  let  e(n)  denote  the 
n n n n n 


(K  +l)-vector  with  elements 
n 


r W1*)) 

qn  t.(n) 
i 


i=l,K  +1 
5 n 


ri(n)  -< 


e (T.(n)) 
n i 


t^(n) 


1=2 ,3, . • . ,K 


V. 


Note  that  T(n)  = T(n)  - e(n).  Finally,  let  I,.  ...  denote  the 

1\  T X 

n 


(K  +l)x(K  +1)  identity  matrix,  - denote  the  (K  +l)x(K  +1)  matrix 
n n K.  +x  n n 

n 


all  of  whose  elements  are  1,  and  M(n)  denote  the  matrix  [I,.  , . + 

1\  +X 


v-l. 


n 


(K  +1)  ‘ J with  rows  m.(n)  and  (i,j)th  elements  m..(n),  i,j  = 

n K tX  i i*] 

n J 


1, . . . ,K  +1. 
n 


Lemma  3.10.  Under  the  alternatives  (3.56)  subject  to  (3.57)  - (3.63), 


. 5/23/2 

1/7  n p, 


(— ) C 

K 2>  L 9 


n (fr(n)  - y(n))] 


n 


1/2  , 

is  asymptotically  normally  distributed  with  mean  -(27/2)  d5t 


and  variance  1. 


Proof.  Let  Sn(n)  denote  the  quantity 


[?'(n)V_1(n)T(n)]2 


CT' (n)v"1(n)V"1(n)T(n)][T, (n)M(n)T(n)] 


and  let  S^(n)  = W(n)  - S^(n).  Thus,  S^(n)  is  the  constant  term,  and 


8^(0)  represents  the  remaining  terms  in  a Taylor  series  expansion  of 
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$(n)  about  the  point  T(n).  We  first  consider  S^(n). 
S^(n)  - S^(n)  by  A^,  we  have 


Denoting 


j = CY'(n)T(n)]2 Cy' (n)T(n)]2 

n Y1 (n)Y(n)T' (n)T(n)  y' (n>Y(n)T' (n)M(n)T(n) 


Recalling  that  Y1  (n)  = 2(Ln-l)T'(n)  + e'(n),  T'(n)T(n)  = (n/L^Kl  + 

0(p  log(l/p  )))  and  Y'(n)Y(n)  = 4(L  -l)2(n/L  )(l+o(l)),  we  have 
n n n n 


Y' (n)Y(n)T* (n)T(n)T* (n)M(n)T(n)  = 4(Ln-l)2(n/Ln)^(l+o(l) ) , 
and,  neglecting  smaller  order  terms,  Afi  can  be  written  as 


e'(n)e(n)  + L-1e' (n)e(n)  + (2nL  ) ^Ce' (n)T(n)]£e' (n)e(n)] 
n n 


n I + (2n)  2[e' (n)e(n)][e' (n)T(n)]2 + n 1[e' (n)T(n)][e' (n)e(n)]  J 


A* 

j Ce'CnAnJDCe'CnJ^n)]  + (2n)'1[e'(n)T(n)]2[e'(nAn)] 
n L_1[e' (n)T(n)]2  + [e' (n)T(n)][e' (n)e(n)]  + (4L  ) \e' (n)e(n)]2 


L2  Kn+1 

- l T.(n)]2(l  + n'VtnjTti.)  + (2n)"2[e' (n)T(n)D2}. 

^ , i 

n i=l 


Finally,  using  the  definitions  of  e(n)  and  e(n),  approximating  sums  by 
integrals,  and  neglecting  smaller  order  terms,  it  is  easily  shown  that 


4n  = P(n,en). 


We  next  consider  82(0).  To  do  so,  we  return  to  the  notation  of  lemma 


L 


t 


t 
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3.7.  We  will  first  show  that 


»(k! 

V2' 


. (T(n) ) = 

k 


Jk> 

w.  . 

h-V 


.1. 


. (^(n) ) (l+o(l)) 


(3.64) 


as  n ■+  00  for  all  k=l,2,...  and  all  indices  i,  e {1,2,...,K  +1}.  By 

k n 

(3.45),  (3.64)  follows  immediately  from  the  following  relations: 


A' (T(n) ) A! (T(n) ) 

— = _i_ (i+o  (l)) 

A(T(n) ) A(T(n)) 


A'.' . (T(n) ) A'.’.(?(n)) 

(l+o(D) 

A(T(n))  A(T(n) ) 


B'.(T(n))  B'.(?(n)) 

— = (l+o(l)) 

B(T(n) ) B(T(n) ) 

(3.65) 

B'.'.(T(n))  B'Mfyn)) 

_i3_ = _ii (l+o(l) ) 

B(T(n) ) B(T(n) ) 


as  n -+  » for  i,  j = 1,2, . . . ,Kn+l. 

To  prove  (3.65),  we  first  consider  the  quantity  Y'(n)e(n),  which 
can  be  written  for  asymptotic  purposes  as 


i-Pr 


2n  J 

P. 


T(x) 


t(x)  n 


•e  (T(x))dx  + 


L 

n 

3n 


n 


2 1-^r 

J 

P« 


T3(x) 


t3(x)  n 


e (T(x))dx. 


Applying  the  mean  value  theorem  for  integrals  (e.g.  Bartle  Cl, p.327]), 

— -1  1/2 

(3.62),  (C.8)  and  (C.10),  we  have  Y'(n)e(n)  = o(nb  (n)(log  l/pfi)  ) 
as  n ■+  00 . Using  this  fact, 

Al(f(n))  2yi(n)Y,(n)T(n) 

A(T(n) ) C Y 1 (n)T(n)32 

AI  (T(n) ) Y-(n)Y' (n)e’(n)  y'(n)T(n)  2 

= [_± i ][ ] 

A(T(n) ) [ Y* (n)T(n)]2  Y'(n)T(n) 


A 
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,<v» 


A^(T(n) ) Lnlog(l/pn) 


-1/ 


,1/2, 


[■— r + o (-u'm  “ )][1  + o(b  (n)(log  l/pnr#‘)] 

A(T(n) ) nMn) 


*!<*<"»  -1  1/2 
— — [1  + o(b  (n)(log  1/p  ) / )], 

A(T(n) ) n 


where  the  last  equality  follows  from  (3.44).  The  remaining  relations  in 
(3.65)  are  similarly  established  and  (3.64)  follows.  Next  we  note  that 
since 


ti(n)  = ti(n)[l 


en(Ti(n))Ti(n)  e^(T.(n)) 


ti(n) 


'T'tn) — = ti^n^l  + <5(i»n)] 


where  S(i,n)  converges  to  zero  as  n » uniformly  for  1 < i <.  K^+l, 


<\j  <\j 

the  covariance  matrix  of  +^)  under  the  sequence  of  alternatives 

n 


(Hn(x)}  may  be  taken  to  be  V(n)  as  defined  in  (3.6)  for  all  asymptotic 


purposes.  Thus,  we  have  shown  that 


E(S2(n)}  = E{S2(n)  + S3(n)}(l+o(l))  = o (a(n)). 


and 


'w 


o{S2(n)}  = a{S2(n)  + S3(n)}(l+o(l))  = a(n)(l+o(n)), 


Thus,  under  the  sequence  of  alternatives  (Hn(x)},  the  quantity 


o 1(n)CW(n)  - S^(n)]  = a 1(n)[W(n)  - y(n)3  + a 1(n)An 


is  asymptotically  distributed  as  standard  normal.  Applying  (3.61)  and 
lemma  3.9,  the  proof  of  lemma  3.10  is  complete.  QED 


-»■ 


121 


Theorem  3.4  follows  immediately  from  lemma  3.10. 

3.5  Extensions 

The  results  of  section  3.4  are  easily  extended  to  the  statistic  W*(n), 
as  summarized  in 


Corol2ai*y  3.1.  Assume  (3.56)  through  (3.60)  hold  as  well  as  n 

L 

(i)  (3.62  and  (3.63)  hold  with  C(n)  replaced  by  C*(n)  = —37^' -3 /'g C(o) 

n Pn 

(ii)  Lim  np*(n,e  ) = ds),  0 < d*  < ® 

* n 1*  1 

n-*° 


where 


■1-P. 


P*(n,en) 


n e (T(x))  2 


1-P 


n e (T(x) ) 2 


= 1 c "too  ] dx  - { / ^00 — dx} 


■ n 


1-P. 


Then,  n(l  - W*(n))  - a*  converges  in  law  to  T(y  - d*)  as.  n ■>  °°. 

Proof.  Using  lemma  3.9  (with  = 1)  and  expanding  W*(n)  in  a Taylor 

series  about  T(n),  the  constant  term  in  the  expansion  is  given  by 

1 - p*(n,en)  as  n “.  The  methods  used  in  the  ^of  of  lemma  3.10  apply 

immediately  to  the  quadratic  and  higher  order  terms,  yielding  the  fact 

that  the  differences  between  these  terms  in  the  expansions  under  Hq  and 

are  o^d/n).  To  see  that  the  linear  term  in  the  expansion  is  also 

Op(l/n),  we  use  (3.24)  (with  y(n)  replaced  by  T(n))  together  with  assump- 

Ml)  

tion  (i)  above  to  obtain  the  result  that  w.  (T(n))  is 

i 

3/2 

0(Ln(log  l/pn)  /nb(n))  as  n -*■  “.  Hence,  using  (3.16),  the  variance 

3 2 

of  the  linear  term  is  of  smaller  order  than  (log  1/Pn)  /nb  (n),  which  is 
2 

0(l/n  ).  Applying  Theorem  3.2,  the  proof  of  the  corollary  is  complete.  QED 
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We  remark  that  the  Schwarz  inequality  implies  that  the  "metric'* 
p*(n,en)  is  always  positive.  The  same  cannot  always  be  said  to  hold  for 
p(n,en>.  In  fact,  the  ?/-test  can  have  very  interesting  power  functions, 
as  illustrated  in  example  3.2  below.  The  next  result,  however,  gives 
one  example  of  a sufficient  condition  for  p(n,en)  = p*(n,en). 


Corollary  3.2.  Assume  that  (3.56)  through  (3.63)  hold  as  well  as 


(A)  Lim  L p /n  =0 
, n n 
n-*» 


(B) 


e (T(x))  00 

' is  of  the  form  £ c (n)T  (*) 

m=0 


t(x) 


where  the  constants  (cm(n)}  satisfy 


cm(n)  = 0(cm_1(n)pn)  111=1,2,, 


Then, 


p(n,en)  = p*(n,ep)  = 2c0(n)pn(l+o(l) ) 


as  n ->■  00 . 


Proof.  Let 


.(1)  - 


r2  1-p  . . 1-p 

, L 00  rn  ™3+m,,  . rn  0 

Vfdx/  T(x,ax 

n m=0  pn  t (x)  pn 


1-P. 


- J 


n T4(x) 


pn  t (x)  pn 


dx  j T1+m(x)dx] 


L2  « 


(log  l/p_) 


m+1 


= 3 2 ^ °2m+l(n)  p 

n m=l 


n 


-(1  + 0(pnlog(l/pn))) 


= °(  — cQ(n)  (pnlog  l/pn)  ) 
n 


~y  f,*7r 


-v  ‘ g-jy?’ 


.^4 
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since  the  expression  in  brackets  is  identically  zero  for  m = 0,1,2  and 
all  even  integers  thereafter.  Note  that 


Jn  e (T(x) ) 


/ [-nt(xT  1 dX  = c0(n)(l-2pn+0(pn)) 


' n e (T(x))  2 


{ / ~t(xT"  dx}  = c0(n)((l-2pn)  +0(pn)) 
Pr.  ' ' 


T(x)_  _ 2. 


{ / nffen(T(x))dx}  = ^(n)(l+0(px))  = cj(n)0(pj). 
Pn 


Combining  the  above  and  neglecting  smaller  order  terms,  we  see  that 
2(1)  2 

p(n,en)  = 2cQpn  + p , and  p*(n,en)  = c^p^.  Finally,  assume  that 


n5/2  3/2 

± * f 


This  imples  that  c.(n)  = 0(L  /(np  ) ' ',  so  that 

u n n 


5/2  3/2  , 9/4,,  , . ,2 

" pn  (1)  „,Lnpn  (lc«  1/pn>  

2 P 3/4 ) = ^ ^ 

L n 

n 

by  assumption  (A).  Thus,  under  (3.61),  p^  is  of  smaller  order  than 

2 2 
o0(n)pn,  and  p(n,en)  = p*(n,en)  = 2c0(n)pn(l+o(l) ) as  desired.  QED 

It  is  clear  from  the  form  of  the  power  function  of  the  W-test  that 
the  rate  of  approach  of  alternatives  to  the  null  hypothesis  which  yields 
a nontrivial  power  increases  as  the  order  of  L decreases.  Hence, 
assymption  (A)  is  quite  reasonable.  Examples  3.1  and  3.2  below  illustrate 
assumption  (B)  when  (A)  does  and  does  not  hold,  respectively. 
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i. 


Before  turning  to  the  examples,  we  introduce  two  additional  statistics 


motivated  by  the  results  above.  In  both  p(n,en)  and  p5!(n,en),  the  term 


-W  ~ 


n e (T(x) ) 2 


<\,2  ( 

arose  because  D (n)  was  used  in  the  demoninators  of  W(n)  and  w‘,e(n). 


^>2  'v  ^ 

In  particular,  we  found  that  replacing  D (n)  by  Y'Y  results  in  the 


elimination  of  precisely  the  above  term  from  each  of  the  two  metrics. 


That,  of  course,  does  not  guarantee  that  the  tests  based  on  the  modified 


statistics  will  have  improved  power,  since  the  distributions  of  the 


statistics  under  the  null  hypothesis,  and  hence  the  critical  regions  of 


the  tests,  would  be  expected  to  change  also.  However,  it  turns  out  that 


the  asymptotic  distribution  of  the  modified  W(n)  does  not  change,  and 


the  distribution  of  the  modified  W*(n)  changes  only  slightly. 


Formally,  we  define  the  following  two  statistics: 


W(n)  = 


[?' (n)V_1(n)ir 


CT’(n)v'1(n)v'1(n)^(n)]?'^ 


W*(n)  = U 


CT'(n)T(n)]?'? 


The  asymptotic  distributions  of  these  statistics  under  the  null  and 
alternative  hypotheses  are  summarized  in  the  last  two  corollaries  of 


this  section. 


- — cZaMJsm  ^ <-«*■■** 
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Corollary  3.3.  Under  , 

, 5/2  3/2 

1/2  n p ^ 

(i)  &)  — (Sf(n)  - u(n) ) converges  in  law  to  $(x) 

2 L 

n 

(ii)  n(l  - ( n ) ) - a**  converges  in  law  to  ^(y),  where 

Kn+1  Kn+1  L 

* -5  "I  XC^Ti(n)Ti<n>  ' Vvii<n) 

1=1  j=l 


and  ¥ (y)  is  the  distribution  of 


- (Xk-1) 
l V 


with  X„,  X-,...  i.i.d.  standard  normal  random  variables. 

■ 2.  o ‘ 

Proof,  (i)  Consider  tne  effect  of  replacing  D (n)  by  on  the  ex- 

pansion (3.19).  Since  ?.  = 0,  S^n)  is  unchanged.  Similarly,  w/^Cfyn)) 
is  unchanged,  so  that  S2(n)  remains  the  same.  The  only  effect  on 
w^2^  (^(n))  is  to  eliminate  the  term  (K^+l)  , but  this  has  no  bearing 

on  the  asymptotic  order  of  Sg(n).  In  fact,  (3.44)  still  holds  (except 

that  B’.'.(.)  = 0 for  i / j)  so  that  the  effect  on  Su(n)  is  asymptoti- 
13  4 -v 

cally  negligible.  In  summary,  the  asymptotic  distribution  of  fr(n)  is 
the  same  as  that  of  W(n)  under  the  null  hypothesis. 

(ii)  The  proof  of  this  result  appears  at  the  end  of  Appendix  B.  QED 


Corollary  3,4.  Assume  (3.56)  through  (3.60)  hold. 

(i)  Assume  (3.61)  through  (3.63)  hold  with  p(n,en)  replaced  by 

1-pn  e (T(x) ) 2 

Pl(n,en)  = P(n,en)  + ( / nt(x)  “ dx>  • 
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Then, 


, 5/2  3/2  „ 

<^>  (Sf(n)  - y(n) ) 

L 

n 


1/2 

converges  in  law  to  $(x  + (27/2)  d*)  as  n + °°. 

(ii)  Assume  (i)  and  (ii)  of  Corollary  3.1  hold  with  p*(n,e  ) replaced 


kL 


1_Pn  e (T(x))  2 

P*(n,en)  = p*(n,en)  + { / nt(x)-  dx}  . 

pn 

Then,  n(l  - W*(n))  - a**  converges  in  law  to  T^(y-d*)  as  n ■»  °°. 

Proof,  (i)  The  change  in  the  metric  from  p to  p^  is  easily  verfied. 
Since  (3.64)  remains  valid,  the  remainder  of  the  proof  of  lemma  3.10 
applies  and  the  result  follows,  (ii)  The  change  in  the  metric  from  p* 
to  p*  Is  again  easily  verified.  The  proof  of  corollary  3.1  appli.es  to 
the  linear  and  higher  order  terms  in  the  expansion  of  W::(n)  and  the 
result  follows.  QED 


a. 


Thus,  the  power  of  the  test  based  on  W(n)  is  strictly  greater  than 
the  power  of  the  test  based  on  Vif(n).  We  cannot  make  such  a strong  state- 

'V 

merit  about  the  power  of  the  tests  based  on  W*(n)  and  Sf*( n),  but,  in 


general,  W*(n)-tests  are  more  powerful  and  often  hyper-efficient  with 


<v. 


respect  to  W*(n)-tests. 


3.6  Examples 

Example  3.1.  Let  H (x)  = (1-6  )<f(x)  + 6 $(x+X  ),  where  0 < 6 <1, 

n n n n n 

3 /4  -1 

and  take  L = n (log  1/p  ) . From  assumption  (3.4),  this  is  "almost" 


127 


as  small  as  L can  be  made.  Finally,  let  X = p . Then 
n J ’ n *n 


en(x)  = 5n^^x+pn^  " 


Gn(?  ^Cy ) ) = <5n($($  1(y)+pn)  - y)' 


Using  a Taylor  series  expansion  and  rearranging  terms, 


$(<!>  1(y)+Pn)  = y + pn<K<i>_1(y)) 


, ,Pn  PnNA-l,  \ . Pn/A-1/  ^2 

i - — - (—  - -g)*  (y)  + -g(*  (y)) 


2§-(«>_1(y)  )3  + 0(p^(log  1/Pn)4) 


so  that 


en($  1(y ) ) 
<f>(i>'1(y))  Pn  n 


, Pn  ,Pn  PnN^-l,  . Pn,A-l,  ^2 

1 - -g  - (~2  - -g-)*  (y)  + -g(*  (y)) 


-2^-(^_1(y ) )3  + o(p^(iog  i/Pn>4) 


Thus,  both  (A)  and  (B)  of  corollary  3.2  are  satisfied,  and 


p(n,Gn)  = P*(n,en)  = 2pn(pn5n;  (l+o(l)). 


Also, 


pi(n,Gn)  = pi^n’en^  = (Pn5n) 


Finally,  for  purposes  of  comparison,  the  Kolmogorov- Smirnov  distance  is 


given  by 
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sup  |Hn(x)  - <i>(x)|  = sup  | Gn(x)  | = ( 2u ) 1/,?'pn6n. 

-03<X<»  -»<X<°0 

i 

The  conditions  yielding  non-trivial  powers  and  the  powers  themselves  for 
the  various  tests  we  have  considered  are  summarized  in  Table  3.2.  Let 
"An  « B^"  signify  that  the  sequence  of  tests  based  on  the  statistics 
{An}  in  hyper-efficient  with  respect  to  the  sequence  of  tests  based  on 
the  statistics  {Bn>.  Then,  for  the  sequence  of  alternatives  considered 
in  this  example, 

A A _ 

TCnje^Gj)  « W(n)  « W(n)  « W*(n) 

where  D denotes  the  two-sided  Kolmogorov-Smirnov  statistic. 


Example  3,2.  We  again  let  H (x)  = (1-6  )$(x)  + 6 $(x+p  ),  except  this 

n n n n r 

time  we  choose 


L 

n 


.8 


= n 


_ n-(log  n)“2/5 


Feasibility  conditions  (3.2),  (3.3),  and  (3.4)  are  easily  verified  for 
these  functions.  Using  results  in  the  proof  of  corollary  3.2,  we  have, 
neglecting  smaller  order  terms, 

P(n’En>  = - W«V"><106  Vp„))\«n  + 2pn5j. 


Note  that 

pn6n=  o((Ln/n)2(log  l/pn)2pn6n)  iff  6n=  o( (Ln/n)2(log  l/pn)2), 


and 
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TABLE  3.2 

ASYMPTOTIC  POWERS  UNDER  H (x)  = (1-6  )<t>(x)  + 6 <Kx+p  ) 

n n n rn 


Sequence  of  tests 
based  on 


Condition  yielding  non- 
trivial power 


Power  of  level-a  test 


W(n) 


fr(n) 


W"(n) 


'V, 

W*(n) 


T(n;91,62) 


n1/2p9/4(i°g  l/pn)6n  = dx 


n^'Pn^dog  1/Pn)«n  = *2 


1/23/2.  _ . 

n p O’*  do 
rn  n d 


n1/2p  6 = d 

*n  n 4 


n1/2p«  -d, 
rn  n 5 


T 1/41/4  . _ . 

L n p 6 - d. 

n n n 6 


*(c.  + '54)1/2d2) 

1 


+ (27/2)1/2d2) 


1 - ‘ V 


1 - V0»1,l-Cl  ' V 


ft 


$(cA  + 1. 99d_ ) 
4>,a  o 


F.a 


where  c„  is  defined  by  J dF(x)  = a 
F.a  * 

7 rrt 


ft 


closed  form  expression  does  not  exist 
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5/23/2  t2,.  w n2 

n pn  L(logl/p) 

L“  —2 2 >>n5n  " ” 

n-xo  L n 

n 


implies  that 


-1/2  -5/2,,  , # Ln(log  1/pn)  N 

0(n  pn  (log  l/pn)  ) = o( 2 ) 

!.,  ,1  jj 


In  Figure  3.3,  we  use  the  above  facts  in  conjection  with  assumption  (3.61) 

to  plot  the  power  of  the  ?/-test  as  a function  of  the  rate  at  which  Sn 

*“1/2  -5/2  **2 

approaches  zero.  When  6^  = consfn  p^  (log  1/p^)  > p(n,en)  < 

and  any  power  between  0 and  a is  obtainable  by  choosing  the  constant 

2 2 

appropriately.  For  6n  = const- (Ln/n)  (log  l/pn)  > any  power  between  0 
and  1 is  obtainable  by  choosing  the  constant  to  solve  the  appropriate 
quadratic  equation. 

Figure  3.3  illustrates  a rather  interesting  phenomenon — a test  which 
is  both  biased  against  a sequence  of  contiguous  alternatives  and  con- 
sistent. As  alternatives  approach  the  null  hypothesis  at  a very  fast  rate, 
the  test  cannot  discriminate  between  the  alternatives  and  the  hypothesized 
distribution,  and  the  power  is  equal  to  the  level  of  the  test.  As  the 
rate  of  approach  slows  down,  the  test  can  discriminate,  but  because  of 
the  bias,  the  power  decreases  rather  than  increases.  Eventually,  if  the 
rate  of  approach  slows  down  enough,  the  consistency  property  of  the  test 
takes  over  and  the  power  increases  to  one.  What  is  most  interesting  is 
that  the  rates  at  which  the  bias  sets  in  and  the  consistency  sets  in  are 
different,  resulting  in  a set  of  alternatives  which  are  never  detected  by 
the  test. 


The  author  knows  of  no  analogs  to  the  above  behavior  in  the  literature. 


■-  z* 
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Bias  has  been  exhibited  for  both  the  Kolmogorov- Smirnov  test  (Massey  [183 ) 
and  the  Cramer-von  Mises  test  (Thompson  [36]).  However,  in  each  jase,  the 
bias  was  proved  only  for  a fixed  alternative  and  fixed  n finite;  to  our 
knowledge,  analysis  similar  to  the  above  in  which  contiguous  alternatives 
are  considered  has  not  been  reported. 

3.7  Concluding  Remarks 

There  are  at  least  three  results  above  which  deserve  further  emphasis 
in  that  they  suggest  possible  areas  for  future  research. 

First,  the  large-sample  results  above  for  the  tests  based  on  a gradu- 
ally increasing  number  of  order  statistics  stand  in  contrast  to  the 
empirical,  small-sample,  results  obtained  for  the  analogous  statistics 
based  on  all  n order  statistics.  On  the  basis  of  the  empirical  studies 
cited  in  section  3.1,  the  consensus  in  the  literature  has  been  that  in 
using  the  Shapiro-Francia  test,  an  approximation  to  the  Wilk-Shapiro  test, 
one  gains  computationally  but  pays  at  least  a small  price  in  terms  of 
sensitivity  to  alternative  distributions.  Above,  we  found  that  it  was  ft 

that  approximated  W«,  and  not  vice  versa,  in  the  sense  that  the  errors 

. . . . 'v 

in  the  approximation  controlled  the  asymptotic  distribution  of  W and 

.... 

caused  the  associated  test  to  be  inferior  to  the  W-'-test  in  discriminating 
contiguous  alternatives.  In  section  3.2.4,  we  mentioned  a possible 
approach  toward  resolving  the  open  question  of  the  relation  between  the 
large-sample  and  small-sample  results. 

^ <v 

# _ 'V,, 

Second,  the  results  for  the  modified  statistics  W and  W5S  suggest 
that  modifying  W and  W*  in  the  same  manner  would  lead  to  superior 
tests.  It  would  be  useful  to  carry  out  empirical,  small- sample,  studies 
on  such  tests.  In  view  of  the  widespread  acceptance  of  the  W-  and 
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and  W*-tests,  it  would  be  of  considerable  practical  value  if  such  studies 
should  yield  results  which  agree  with  the  large-sample  results  obtained 
here. 

% 

% 

Finally,  the  test  based  on  W*  is  worthy  of  further  study  xn  its  own 

right.  In  example  3.1  we  found  that  even  against  alternatives  which  are 

particularly  easily  discriminated  by  the  Kolmogorov- Smirnov  test  (i.e. 

the  Kolmogorov- Smirnov  distance  is  maximized  at  the  origin),  the  ?/-test 

performed  well.  Since  the  metric  p“(n,en>  heavily  weights  disturbances 

in  the  tails,  it  should  not  be  difficult  to  find  situations  where  the 

f/"-test  is  very  efficient  relative  to  the  Kolmogorov- Smirnov  test,  and 

perhaps  to  the  other  EDF  tests  as  well.  It  would  also  be  interesting  to 

% 

study  the  empirical  behavior  of  the  Sf*-tcst,  but,  as  pointed  out  in 
section  3.2.4,  for  moderate  sample  sizes  the  asymptotic  theory  is  in- 
adequate, and  even  the  distribution  of  W*(n)  under  the  null  hypothesis 
would  have  to  be  computed  empirically. 


APPENDIX  A 


MONTE  CARLO  RESULTS  FOR  THE  WEISS  STATISTIC  UNDER  HQ 


In  this  appendix  we  describe  the  results  of  a Monte  Carlo  study  of 

A A 

the  distributions  of  the  statistics  TCnjS^jG^)  and  T(n;0^,02)  (defined 
in  section  2.2.2)  under  the  null  hypothesis  of  normality . 

For  each  of  the  sample  sizes  n = 100,  n = 200,  and  n = 300,  we 
consider  three  choices  of  the  functions  Ln,  K^,  and  p^.  These  choices  are 
labeled  cases  I,  II,  III  as  defined  below. 


1-6 


x-o  -1/4 

Case  I:  Ln  = greatest  integer  < n where  6^  = (log  n) 

2 

6 -6„ 

. nr,  _ nn 

K = greatest  even  integer  <_  n LI  - 2n  J 


p = (n  - K L )/2n  . 

*n  n n 


Case  II:  L = greatest  even  integer  <.  n where  6 = .7 
n — 

p^  taken  as  close  as  possible  to  the  value  in  case  I such  that 
np^  is  an  integer  and  Kn  = 2 


K = (n  - 2np  )/L 
n *n  n 


Case  III:  same  as  case  II  with  6 = .3 


Since  we  were  particularly  interested  in  studying  different  choices 

of  L , the  values  of  p were  made  as  uniform  as  possible  in  order  to 
n’  rn 

minimize  the  effect  of  moving  into  the  tails  at  different  rates.  The 
values  of  the  functions  for  the  different  cases  and  sample  sizes  are  sum- 
marized in  Table  A.l. 
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TABLE  A.l 

PARAMETER  SETTINGS  FOR  MONTE  CARLO  STUDY 


Parameter  I II  III  I II  III 


4 24  39  6 40  69 

16  3 2 26  4 2 

.18  .14  .11  .11  .10  .115 


II  III 


54 

95 

4 

2 

.14 

.18 

For  each  case  and  value  of  n,  100  sets  of  n N(2,4)  random 
variables  were  generated  using  the  Box-Mueller  method,  and  the  statistics 

A A 

T(n;0^,02)  and  TCn;©^,©^  were  computed.  The  ordered  values  of  the 
statistics  are  plotted  on  normal  probability  paper  in  figures  A.l 
through  A. 6.  Each  figure  contains  the  results  of  cases  I,  II,  and  III 
for  a specified  value  of  n,  the  odd  numbered  figures  when  0^  and 
©2  were  specified,  the  even  numbered  figures  when  they  were  estimated 
from  the  samples.  Deviations  from  linearity  in  the  figures  correspond 
to  deviations  from  normality  of  the  empirical  distributions  of  the 
statistics. 

The  study  was  admittedly  limited  in  scope;  those  conclusions  we 
felt  were  valid  are  discussed  in  section  2.4.1. 


FIGURE  A.l 

NORMAL  PLOT  OF  ORDERED  T(100;6  ,0  ) VALUES  F 


FIGURE  A.  2 

NORMAL  PLOT  OF  ORDERED  T(1OO;0  ,0  ) VALUES  FOR  CASES  I(®) 
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FIGURE  A.  3 

NORMAL  PLOT  OF  ORDERED  T(2OO;0_ ,0.)  VALUES  FOR  CASES 


FIGURE  A. 6 

NORMAL  PLOT  OF  ORDERED  T(300;$  ,§  ) VALUES  FOR  CASES  I(®) ,11(0) ,III(*) 
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APPENDIX  B 
PROOF  OF  THEOREM  3.2 


Throughout  this  appendix,  we  assume  that  Y^(n) , . . . ,Yn(n)  are  the 

order  statistics  of  a sample  from  a standard  normal  population.  Suppressing 

the  dependence  on  n,  we  let  ^ denote  the  (K  +l)-vector  with  elements 

n 

'll . = Y ,.  ,,r  . and  let  Z denote  the  (K  +l)-vector  with  elements 
i npn+  (i-l)L  ’ n 

- v/nt.(n)(?.  - T.(n)),  where  n,  p , L , t.(n),  and  T.(n)  are  as 
l x xx  ’ *n  n i x 

defined  in  section  3.2.1.  The  distribution  of  ^ is  asymptotically 

indistinguishable  from  the  (K^+D-variate  normal  distribution  with  mean 

vector  0 and  covariance  matrix  £(n)  is  given  by  (2.23). 

Before  proving  the  theorem,  it  is  necessary  to  establish  a linear 

a. 

transformation  which  can  be  used  to  express  Z^  as  the  weighted  sum  of 

K +1  i.i.d.  standard  normal  random  variables, 
n 

B.l  A Linear  Transformation 

Define  U = (U  ,U  , . . . ,U  . ) , where  U. , 1 < i < K +1,  are  i.i.d. 

J-  c.  i\  +1  x n 

n 

N(0,1)  random  variables,  and  let  A denote  the  (K  +l)x(K  +1)  matrix 

n n 


with  elements 


r 


a. . = 
13 


yfl 

i=j=l 

1 

i=j=2,3...,Kn 

1+r 

i=j=K  +1 
J n 

-1 

(i-l)=j=2,3,. 

1 

H-* 

to 

i-l;j=Kn+l 

r 

i=  2 , 3 , . . . , 

0 

otherwise 

,K 


n 


where 


% = 


°Pr 


and 


r = - 


L (1+p  “1/2) 
n rn 

n(l-pn)  ~ 


A. 
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It  is  easily  verified  that  A'A  = £^(n)  (where  £.^(n) 


(n(Ln~l)/L^)^(n)  is  given  explicitly  in  (2.27)).  Thus,  letting  B 


2 -1/2  -1 

(n(L  -1)/Ln)  A , the  random  vector  Z*  = BU  has  the  same  asymptotic 


distribution  as  ^ and  can  be  used  in  place  of  for  all  asymptotic 


probability  calculations. 


Let  D^(m)  denote  the  determinant  of  the  m x m matrix 


1 r 


-1  1+r 


where  all  remaining  elements  are  zeros.  Expanding  by  the  first  row. 


D1(m)  = D^m-l)  + r(-l)m  1(-l)m_'1'  = D^m-l)  + r 


= D^(m-2)  + 2r  = ...  = 0^(1)  + (m-l)r  = 1 + mr  . 


Now  call  D(A)  the  determinant  of  A.  Then,  expanding  by  the  first  row 


of  A. 


K K 

D(A)  = /FD..(K  ) + - (-1)  n(-l)  n = /F(l+K  r)  + - 

1 n /*  n n 


(B.l) 


It  is  now  easily  verified  that  the  matrix  B has  elements  b„  given  by 


L 1/2 

bii  ■ 


1+(K  +l-i)r 
n 


/F(l+(Kn+l-i)r)  i > j > 1 


-r((i-l)/T  + — ) i < j 

/F 


(B.2) 


We  note  that  5u  t»«»  "•'anally  spaced"  case,  Z = 1,  and  the  trans- 


formation A reduces  tc.  . ..o^ormation  given  in  Weiss  [38]. 


B.2  The  De  Wet  and  Venter  Theorem 


The  proof  of  Theorem  3 . 2 is  based  on  the  work  of  De  Wet  and  Venter  [9  , 
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10].  For  completeness,  their  key  theorem  is  included  here.  The  notation 
used  in  this  subsection  is  their  notation  and  does  not  refer  to  any 
quantities  defined  elsewhere  in  this  thesis. 

Let  Z^,Z^, . . . be  i.i.d.  random  variables  with  zero  means,  unit 
variances,  and  finite  fourth  moments.  Let  c^jn  Ci,j=l,2, . . . ,n)  be  real 

numbers  with  c^jn  = cjin 

T 

n 

Theorem  (De  Wet  and  Venter  L10J).  Let  lb.  ji=J.,z,...,n;  m=x,z,... 


for  all  i and  j and  set 

= y c..  Z.Z.  . 

.L . inn  1 1 
1*3 


1 a 3 — _ J 


fv  : ms.l-,2 > be  real  numbers;  let  {M.}  be  a sequence  of  integers_ 

'in’  ’ n — 

M 


with  M ; let 


n 


■ l >J  • 


m=l 


Assume  that  as  n 00  , 


max  b.  *»•  0 for  each  m 
, . 1 lmn ' 

l<i<n 


max  |T  b.  b.,  - <5  , | = o(t  1) 

lmn  lkn  mk'  n 


l4k,m<Mn  i 


2 ,2  _ _,„-2 


max  ^ b.  b.. 

l<k,m<M  i 
n 

r 2 r 2 

) c..  •>  r = AY  <°° 

,L . inn  " m 

1 


= o(t  *) 
n 


I J c..b.  b.  - y |=  o(t  1) 
1 .L . inn  lmn  nmn  m n 


max 


l4k,n>4Mn  1,3 


i]n  lmn  3mn 


Then, 


Dist(Tn  - ETn)  * Dist<  l TT.(y.-l)  ) 


with  Y15Y2,, 


. i.i.d.  N(0,1). 
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The  assumptions  of  the  theorem  appear  quite  formidable  to  verify. 
However,  as  De  Wet  and  Venter  point  out,  it  is  often  the  case  that 

2:  n ^cCi/n+l,  j/n+1) 

where  c(*,*)  is  measurable,  square- integrable  on  the  unit  square,  and 
symmetric  in  its  arguments.  Then,  letting  {g^}  and  {y^}  denote  the 
eigenfunctions  and  eigenvalues  of  c and  supposing  (gm)  is  an 


orthonormal  system  on  (0,1),  selecting  b^mn  = n 


■1/2 


g (i/n+1)  will  often 
m 


satisfy  the  assumptions.  Verification  involves  showing  that  the  differences 
between  the  various  sums  and  the  corresponding  integrals  are  sufficiently 
small. 


B.3  Proof  of  Theorem  3.2 

We  consider  the  Taylor  series  expansion  of  W*(n)  analogous  to  the 

expansion  (3.19).  It  is  easily  seen  that  the  constant  term  in  the  new 

expansion  is  identically  equal  to  1;  the  linear  term  is  identically  equal 

to  zero,  and  the  remaining  terms  are  asymptotically  exactly  the  same  as  in 

% 

the  expansion  (3.19)  (with  T^(n)  replaced  by  T^(n)  since  qn  = 1). 

Thus,  as  proved  in  lemma  3.7,  the  cubic  and  higher  order  terms  (i.e.  S4(n)) 
are  of  smaller  order  in  probability  than  the  quadratic  term  as  n <*>  , 
and  the  asymptotic  distribution  of  (1  - W*(n))  is  thus  the  same  as  the 
asymptotic  distribution  of  (-S2(n))  given  by 


K +1  K +1 
n n 


I I 

i=l  j=l 


[—  T.  (n)T.  (n)  + (K  +1)"1 
nii  n 


6 • • ][Y .-T.  (nj.lCY. -T.  (n)].  (13.3) 
ID  i i D D 


We  define  the  functions  {£„;  i,j=l,2, . . . ,Kn+^}  by 


r 


hi  ■< 


1 “ (Pn  + (i-D-J-)  i 1 j 


-(pn  + (i-l)-J-  ) 
n n 


i < j 


Using  the  definitions  together  with  the  transformation  (B.2),  tedious  but 


straightforward  calculations  yield  the  result  that 


T K +1  K +1 
L n n 


-nS2(n)  = 


— 7 I c. .(n)U.U.  + R(n) 

n i=l  j=l  13  13 


(B.  4) 


where 


c. . (n)  = c^}  (n)  - c[V  (n)  - c^}  (n) 
13  13  13  13 


, K +1  . p 
L n t,,  . 

(„)  , -2.  I 

13  k=l  t2  Cn) 

t2  K +1  K +1  r _ 

cC2)  („)  = — ny  ny  ki ^3 

13  n2  k=l  m=l  Vn)Vn) 

h n n T.  (n)T  (n) 

UJ  /\_nr  V k m _ _ 

°ij  ' n Jl  m=l  tk(n)tr„<n)  ki  "J 


1/2  2 

and  R(n)  is  o (p  (log  1/p  ) ) as  n 00 * In  writing  the  coefficients 

P n n 

2 2 2 

for  the  above  sums,  we  have  used  the  fact  that  D (A)  = £r  Kn(iro(l)) 
as  n ->■  00  . 

f\, 

The  asymptotic  expected  value  of  N(l-WiV(n))  is  obviously  a*, 

which  was  shown  in  (3.41)  to  be  o(log  1/p  ) as  n -*•  00  . Futhermore, 

n 

the  expression 


. K +1  K +1 
L n n 


- I l c. . (n)U.U. 
n i=l  j=l  13  13 


in  (B. 4)  above  is  completely  analogous  to  expression  (50)  of  De  Wet  and 
Venter  [9]  with  their  replaced  by  5^,  except  that  the  {U^} 

here  are  normally  distributed,  whereas  the  {Z^}  in  their  expression  (50) 
are  exponentially  distributed  with  mean  zero  and  variance  1.  However,  the 
theorem  stated  in  section  B.2  depends  only  on  the  independence  and  certain 
moment  conditions  of  the  variables,  all  of  which  are  satisfied  in  either 
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case.  Hence,  their  derivation  of  the  distribution  of  their  statistic 

C9jPP-  144-5]  and  [10, pp.  385-6]  immediately  implies  that  n(l-W*(n) ) - 
a*  is  asymptotically  distributed  as 


00 


l 

k=3 


1 


where  are  i.i.d.  standard  normal  random  variables.  This 

completes  the  proof  of  theorem  3.2. 

% 

In  section  3.5  we  introduced  the  statistic  W*(n).  Extension  of  the 
above  proof  to  the  distribution  of  this  statistic  requires  further 
elaboration  of  the  method  of  De  Wet  and  Venter.  Set 

c(x,y)  = c(1)  (x,y)  - c(2)  (x,y)  - c(3)  (x,y) 

with 


V J 
i ‘ 


j dz 

0 t (z) 


.(2) 


c(3)  (x,y)  = J J f[z7t(v)  5(z>xK(v>yMzdv  = |g2(x)g2(y) 


1 1 

J J 

0 0 

1 1 


(x’y)  = 1 1 -ffz)t(v)y)  dxdv  = gi(x>g2(y> 


T(z)T(v) 


0 0 


. where 

>• : 


p* 

[ 1-x 

if  x > y 

5(x,y)  = < 

( 

1 -* 

if  x < y 

] and  g^(x)  = H^(T(x))  with  H^(*)  the  k-th  normalized  Hermite  polynomial. 

The  above  functions  are  the  continuous  analogs  of  the  coefficients  in  (B.4). 

! ’ sequences  {g^jg^,...}  and  (1,1/2, 1/3, . . . } are  the  eigenfunctions  and 

j eigenvalues  of  c(1)(x,y),  so  that  {gg,g4,...}  and  (1/3 ,1/4, . . . } are  the 
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eigenfunctions  and  eigenvalues  of  c(x,y).  This  fact  leads  to  the  choices 


b.  = g (p  +(i-l)L  /n) 
lmn  m n n 


Y_  = 1/m 

m 


m = 3,4 , , 


in  the  De  Wet  and  Venter  thoerem,  and  the  result  of  Theorem  3.2  follows  for 
fr*(n). 

For  ft*(n) , expression  (B.3)is  unchanged  except  that  the  term 
<Vir  is  missing.  This,  in  turn,  implies  that  (B.4)  is  unchanged  except 
that  c„(n)  = (n)  - (n)>  ^be  terms  corresponding  to  cf?^  (°) 

vanish.  Thus,  in  this  case,  c(x,y)  has  eigenfunctions  {goSgo,...}  and 

2 o 

*\j 

eigenvalues  {1/2, 1/3, . . . }.  It  follows  that  n(l-ft*(n))  - a**  is  asymptoti- 
cally distributed  as 


~ K - l 

i -V- 

k=2  k 


where  X^,X^,...  are  i.i.d.  standard  normal  random  variables.  This  proves 
part  (ii)  of  corollary  3.3.  We  named  the  above  distribution  (y).  Tables 
of  this  distribution  are  not  available  but  can  be  created  by  numerically 
inverting  the  characteristic  function  as  in  [9]. 


APPENDIX  C 


n 

> I 

i 

i 


1 AUXILIARY  FORMULAS 


In  this  appendix,  we  collect  for*  reference  purposes  various  formulas 
concerning  functions  of  $ ^(x)  which  were  required  in  chapters  2 and  3. 
We  also  illustrate  the  formal  procedure  for  approximating  summation  by 
integration  and  obtain  upper  bounds  on  the  errors  of  such  approximations. 


C.l  Derivatives,  Integrals  and  Limits  for  Functions  of  $ ^(x) 

')  Derivatives . Let  T(x)  denote  $ ^(x),  t(x)  denote  t(>(T(x)),  and 

\>  f(x)  denote  t(x)T(x).  Then  the  first  four  derivatives  of  f(x)  are 

i X 

given  by 

r ; 

\ ■ ' 

f(1)(x)  = 1 - T2(x) 
f(2)(x)  = -2T(x)/t(x) 

(C.l) 

f(3)(x)  = -2(1  + T2(x))/t2(x) 
f(4)(x)  = -4(T3(x)  + T(x))/t3(x). 

It  is  easily  verified  by  induction  that  as  x approaches  0 or  1,  we 
have 


f'  '(x)  = —2 • (k— 2) IT  (x)/t  (x)  + (smaller  order  terms) 


for  k >.  2. 

Integrals . The  analysis  in  chapter  3 required  evaluation  of  expressions 
of  the  form 


to 

i; 
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! 


s i 


> ! 


* 1 


I 

I 


Combining  the  above,  the  integral  in  (C.2)  can  be  written 


1 . 1— k.  \ ._k— 1/  \ 

j^j-t  (x){T  (x) 


|*k  i-j 

+ l ( 1} j t(k+l-2j )(k+3-2j ) - - • (k-1) }Tk-2j-l(x) } |b(n) 
j=l  (k-l)j  a(n) 


+ | (l+(-l)k) 


1-3-5- - -(k-1) 


(k-1) 


k/2 


b(,n)  -k 

/ t K(x)dx. 

a(n) 


(C.5) 


Note  that  is  a(n)  0 and  b(n)  1 as  n -*•  00 , then  the  term 

1 Tk-I(x)  |b(n) 
fc_1  tk'1(x)  a(n) 

dominates  the  above  expressions.  Also  note  that  if  k is  odd  and  a(n)  = 
l-b(n),  then  (C.2)  is  zero  by  symmetry.  Finally,  from  (C.4), 

/ t 2(x)dx  = o(t  1(a(n))  + t b(b(n))) 
a(n) 

= o([a2(n)log  l/a(n)]  1/,2+[(l-b(n))2log  l/(l-b(n))] 

(C.  6) 


if  a(n)  -J-  0 and  b(n)  1 as  n -»■  °°,  where  the  last  line  follows  from 
(C.10)  below. 

Limits . Using  the  fact  [13 ,p.  175]  that 

1 - $(x)  ^ as  x ->  00 

x 
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and  taking  x = <J>  x(y)  (and  y + 1),  we  obtain 


T(y) 

t(y) 


as 


y l . 


The  following  are  also  easily  verified: 


Lim 

y->0 


T2(y) 

log(l/y) 


Lim 

y-K) 


t(y)T(y) 

ylog(l/y) 


= 2 


= -2 


Mm i&L 

y-K)  y(log  1/y) 


1/2 


= /2  . 


(C.7) 


(C.8) 


(C.9) 


(C.10) 


The  corresponding  limits  as  y 1 are  obvious. 

C.2  Approximating  Summation  by  Integration 

For  concreteness,  we  formalize  the  procedure  used  throughout  chapters 
2 and  3 using  two  expressions  which  are  representative  of  those  appearing  in 
chapter  3. 

*1  1c 

(A)  Let  ij>(x)  denote  the  function  TJ(x)/t  (x),  where  k is  a positive 
integer  and  j is  an  even  integer  >.  k.  Also  let  a ^ denote  the  quantity 
Pn  + (i-DL^/n,  and  set  i*  = greatest  integer  <.  Kn/2.  Let  S(n) 
denote  the  sum 


L 

n 

n 


n 

l 

i=2 


-Kani} 


~1 
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and  let  I(n)  denote  the  integral 


/ ij/(x)dx 


Then,  S(n)  - I(n)  = E^n)  + E2(n)  + Eg(n)  where 


L i-' 


E-^n) 


= T I t(a.)  - J ^(x)dx 


n i=2  nl  i 


r K 
L n 


E2(n) 


E3(n) 


~rt  i Mani>  ' / *<*M* 

i=i^l  an(i*+l) 

an(i*+l) 


/ t|)(x)dx  . 


Using  the  fact  that  iJj(x)  is  monotonically  decreasing  on  (0,1/2), 


1“  a . 
r ni 


|E  (n)|  4 l r |«(ani)  - <Kx)|dx 

is9  a * 


1=2  an(i-l) 


• cl  • 

i*  m 


il  I l«ani>  - «a„(i-l))ldx 


1=2  an(i-l) 


L * 

~n  j2  <Wan(i-l))  ' Kani)) 


T (*(anl)  - ,l(ani*)) 


/ 


= °<f> 


P~  (log  1/Pn)''" 


A symmetric  argument  applies  to  | E^( ) | . Clearly,  Eg(n)  0(L^/n) . 

Thus  |s(n)  - I(n)  I is  0((L  /n)p"k(log  1/p  )^"k)/2).  From  (C.5),  I(n) 
1 1 n n n 

is  0(p^  k(log,  l/pn)^  k^).  Hence, 

S(n)  = I(n)(l  + 0(Ln/npn)). 


(B)  Let  S(n)  denote 


K 


L n e (T(a  .))  2 
n v r n nx  n 

it  c tTa„:r"] 


i=2 


ni 


and  let  I(n)  denote 


Then, 


1_pn  e (T(x))  2 

/ C~ 


t(x) 


-]  dx  . 


■ n 


K a . 
n ni 

|S(n)  - I(n) | < l j 

i=2  an(i-l) 


e (T(a  .))  2 e (T(x))  2 


|C^ 


ni 


t(a  . ) 
nx 


-]  - [- 


t(xT 


-]  |d> 


Since 


e (T(x+(L  /n)))  2 

r n-  IJ 1 

L t(x+(Ln/n))  J 


(T(x) ) 
n 

t(x) 


+ 2 


L ecno) 

n n 

n t(0 


e^(T(C)) 

t2(C) 


e (T(5))  T(5) 

_£ i 

t(5)  t(5) 


for  some 


C,  x < 5 < x+(L^/n) , 


(3.62)  and  (3.63)  imply  that 


L L 

| S(n ) - I(n)  | = o(2  (log  1/p  ) + 2 —— 

nb^(n)  n : nb^(n) 


K 


n T(a  .) 


1=2 


m 


D 


L 

= o( — £ — (log  1/p  )) 
nD  (n)  n 


where  b(n)  is  as  defined 
expression  is  o(p(n,en)). 
of  Chapter  2 using  (2.41). 


in  (3.62)  and  (3.63).  In  particular,  the  latter 
Similar  results  may  be  obtained  for  the  integrals 
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